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ADENOVIRUS PARTICLES WITH ENHANCED INFECTIVITY OF DENDRITIC 
CELLS AND PARTICLES WITH DECREASED INFECTIVITY OF 

HEPATOCYTES 

Work described and claimed herein was supported by Department 
5 of Defense Prostate Cancer Research Program DAMD1 7-01-1 -0098 

Department of Defense Breast Cancer Research Program DAMD1 7-01-1 - 
0391. The government has certain rights in such subject matter. 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial 

10 No. 60/459,000, filed March 28, 2003, entitled "DETARGETING OF 
ADENOVIRAL PARTICLES AND USES THEREOF", to Daniel J. Von 
Seggern, and to U.S. provisional application Serial No. 60/467,500, filed 
May 1, 2003, entitled "PSEUDOTYPED ADENOVIRAL VECTORS WITH 
ENHANCED INFECTIVITY TOWARDS DENDRITIC CELLS", to Daniel J. 

15 Von Seggern. 

This application also is related to U.S. application Serial No. 
(attorney docket number 22908-1239), filed the same day herewith, 
entitled "PSEUDOTYPED ADENOVIRAL VECTORS WITH ENHANCED 
INFECTIVITY TOWARDS DENDRITIC CELLS," to Daniel J. Von Seggern. 

20 This application also is related to U.S. application Serial No. 10/403,337, 
filed March 27, 2003 and U.S. application Serial No. 10/351,890, filed 
January 24, 2003, to Michael Kaleko, Glen R. Nemerow, Theodore Smith 
and Susan C. Stevenson, entitled "FIBER SHAFT MODIFICATIONS FOR 
EFFICIENT TARGETING". This application also is related to U.S. 

25 provisional application Serial No. 60/350,388, filed January 24, 2002, 

entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT TARGETING," to 
Susan C. Stevenson, Michael Kaleko, Theodore Smith and Glen R. 
Nemerow, and to U.S. provisional application Serial No. 60/391,967, 
filed June 26, 2002, entitled "FIBER SHAFT MODIFICATIONS FOR 

30 EFFICIENT TARGETING," to Stevenson, Susan O, Kaleko, Michael, 
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Smith, Theodore and Nemerow, Glen R. This application also is related to 
International PCT application No. PCT/US03/02295, filed January 24, 
2003, entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT 
TARGETING," to Michael Kaleko, Glen R. Nemerow, Theodore Smith and 
5 Susan C. Stevenson. 

Where permitted, the subject matter of each of these applications, 
provisional applications and international applications is incorporated by 
reference herein. 
FIELD OF INVENTION 
10 The present invention generally relates to the field of adenoviral 

vectors and the production of such vectors. Targeted and detargeted 
adenoviral vectors are provided. In particular, adenoviral vectors targeted 
to dendritic cells are provided. 
BACKGROUND 

15 The immune system is designed to eradicate a large number of 

pathogens, as well as tumors, with minimal immunopathology. When the 
immune system becomes defective, however, numerous disease states 
result. Immunotherapy is an emerging treatment modality that seeks to 
harness the power of the human immune system to treat disease. 

20 Immunotherapy is designed either to enhance the cellular immune 

response in subjects with diseases characterized by immunosuppression 
and/or to suppress the cellular immune response in subjects with diseases 
characterized by an overactive cellular immune response and/or to mount 
an immune response against pathogens or tumors. Improved 

25 immunotherapeutic protocols are needed. 

In addition, despite the extensive characterization of numerous 
infectious agents and the availability of vaccines, new vaccines are 
needed to protect against or ameliorate diseases, such as tuberculosis, 
malaria (Plebanski et al. (2002) J. Clin. Invest. 7/0:295-301), and a large. 
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number of viruses including human immunodeficiency virus (HIV), herpes 
simples virus (HSV), human papilloma virus (HPV), Epstein-Barr virus 
(EBV), hepatitis C virus (HCV), respiratory syncytial virus (RSV), 
parainfluenza viruses and human metapneumovirus (Letvin (2002) J. Clin. 
5 Invest. 7 70:15-20; Whitley and Roizman (2002) J. Clin. Invest. 1 1 0:1 45- 
151; Murphy and Collins (2002) J. Clin. Invest. 7 70:21-27), caused by 
many clinically-relevant pathogens . Athough vaccines have been 
developed for influenza and anthrax, more effective vaccines to prevent 
or reduce the severity of the diseases caused by these agents are needed 

10 (see, e.g., Palese and Garcia-Sastre (2002) J. Clin. Invest. 7 70:9-13; 
Leppla etal. (2002) J. Clin. Invest. 7 70:141-144; Steinman and Pope 
(2002) J. Clin. Invest. 705:1519-1526). 

Vaccines and immunotherapy have been used to eliminate or a 
wide variety of cancerous cells and tumors to thereby effect treatment of 

15 cancer. Many human cancers are associated with the expression of 
specific proteins, such as tumor antigens, thus providing a means of 
identifying cancerous cells from normal cells, and providing a target for 
immunotherapy. The immune system is capable of recognizing these 
tumor antigens and eliciting an immune response directed against cells 

20 displaying the tumor antigen (van der Bruggen et al. (1991) Science 
254:1643-1647; Sahin etal. (1995) Proc. Natl. Acad. Sci. U.S.A. 
52:11810-11813; Kaplan etal. (1998) Proc. Natl. Acad. Sci. U.S.A. 
55:7556-7561). The identification of these tumor antigens has led to the 
development of vaccine and immunotherapeutic approaches for the 

25 treatment of cancer (Scanlan and Jager (2001) Breast Cancer Res. 3:95- 
98; Yu and Restifo (2002) J. Clin. Invest. 7 70:28-94). 

There, however, remain numerous challenges in the developement 
of effective immunotherapies. These include, for example, a need for 
(i) enhancing antibody and T cell-mediated immune memory, 
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(ii) enhancing T cell responses (CD4+ T helper cells and CD8+ CTLs), 

(iii) establishing mucosal immunity, which is important for vaccination 
against many sexually transmitted diseases, (iv) development of vaccines 
that can diminish the immune response, which is important for the 

5 treatment of autoimmune diseases (Steinman and Pope (2002) J. Clin. 
Invest. 705:1519-1526) and (v) others. 

Most, if not all, adenoviral vector-mediated gene therapy strategies 
aim to transduce a specific tissue, such as a tumor or an organ, or a 
specific cell type, cells as immune cells. Systemic delivery will require 

10 ablation of the normal virus tropism as well as addition of new 

specificities. Multiple interactions between adenoviral particles and the 
host cell are required to promote efficient cell entry (Nemerow (2000) 
Virology 274'A-A). An adenovirus entry pathway is believed to involve 
two separate cell surface events. First, the adenoviral fiber knob 

15 mediates attachment of the adenovirus particle to a target cell through a 
high affinity interaction with a specific cell-surface receptor, which is the 
coxsackie-adenovirus receptor (CAR) for most, but not all, serotypes of 
adenovirus. A subsequent association of penton with cell surface 
integrins a v 0 3 and a v /? 5 , which act as co-receptors, potentiates virus 

20 internalization. Although there are a plurality of adenoviral fiber 

receptors, in addition to CAR, that interact with subgroup B (e.g., Ad3) 
and subgroup C (e.g., Ad5) adenoviruses, both subgroups appear to 
require interaction with integrins for internalization. 

The role of CAR interaction for in vivo gene transfer is not clear. 

25 CAR ablation does not change biodistribution and toxicity of adenoviral 
vectors in vivo (Alemany et al. (2001) Gene Therapy 5:1347-1353; U.S. 
patent application No. 09/870,203, filed May 30, 2001, and published as 
U.S. Published application No. 20020137213). Published studies have 
described conflicting results (Alemany et al. (2001) Gene Therapy 
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5:1347-1353; Leissner et al. (2001) Gene Therapy 8:49-57; Einfeld et ai. 
(2001) J. Virology 75:11284-11291). For example, it has been shown 
that vectors containing an S408E mutation in the Ad5 fiber AB loop yield 
efficient liver transduction in mice, despite having greatly reduced 
5 transduction efficiencies on cells in culture (see, Leissner et al. (2001) 
Gene Therapy 8:49-57). In contrast, vectors containing a more extensive 
fiber AB loop mutation showed a 10-fold reduction in liver gene 
expression (see, Einfeld et al. (2001) J. Virology 75:11284-11291). 

A doubly ablated adenovirus has been prepared by modifying the 

10 CAR binding region in the fiber loop and the integrin binding region in the 
penton base (Einfeld et al. (2001) J. Virology 75:11284-11291). This 
doubly ablated adenovirus, lacking CAR and integrin interactions, was 
reported not only to lack in vitro transduction of various cell types but 
also to lack in vivo transduction of liver cells. Specifically, the doubly 

15 ablated adenovirus was reported to have a 700-fold reduction in liver 
transduction when compared to the non-ablated adenovirus. These 
results, however, were not reproduced by others. 

For many applications, the most clinically useful adenoviral vector 
would be deliverable systemically, such as into a peripheral vein, and 

20 would be targeted to a desired location in the body or a desired cell type, 
and would not have undesirable side effects resulting from targeting to 
other locations. In vivo adenoviral vector targeting is a major goal in gene 
therapy and a significant effort has been focused on developing strategies 
to achieve this goal. Successful targeting strategies would direct the 

25 entire vector dose to the appropriate site and would be likely to improve 
the safety profile of the vector by permitting the use of lower, less toxic 
vector doses, which potentially also can be less immunogenic. Thus, 
there is a need to develop adenoviruses that are fully detargeted in vivo 
for use as a base vector for producing redirected adenoviruses. 
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Therefore, among the objects herein, it is an object to provide 
detargeted adenoviral vectors, methods for preparation thereof, and uses 
thereof. Furthermore, it is in an object herein to provide 
immunotherapeutic methods and compositions therefor. 
5 SUMMARY 

Provided are immunotherapeutic methods and compositions that 
have immunotherapeutic activity. In particular, adenoviral vectors that 
deliver antigens to dendritic cells for processing and presentation to T 
cells are provided. Delivery of antigens to dendritic cells has preventive, 

10 diagnostic and therapeutic applications. 

Detargeted and fully detargeted adenoviral particles from serotype 
C, such as adenovirus 2 and adenovirus 5, adenovirus vectors from which 
such particles are produced, methods for preparation of the vectors and 
particles and uses of the vectors and particles are provided. Retargeted 

15 particles also are provided. 

The particles are detargarted from binding to certain native 
receptors (e.g., coxsackie-adenovirus receptor (CAR) for Ad5 and Ad2), 
and can be targeted to receptors expressed on dendritic cells. In addition, 
among the viral particles provides are particles that do not bind to or 

20 exhibit reduced binding to HSP (Heparin Sulfate Proteoglycans; also 

referred to as heparin sulfate glycosaminoglycans), and, hence, exhibit 
reduced or no binding to hepatocytes, which express HSPs. 

Provided are the adenoviral particles and genomes encoding such 
particles and/or genomes (viral nucleic acid molecules), cell lines and 

25 methods for producing such particls. In particular genomes that encode 
Ad5 particles or other type C viral particles that express fibers from 
adenovirus subgroup D or subgroup B, such as Ad19p, Ad30, Ad37, 
Ad16 and Ad35 (or that express modified fibers thereof) are provided. 
The fibers are modified to permit incorporation into the particle. The viral 
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particles provided herein exhibit reduced binding to hepatocytes and 
hence reduced liver toxicity. 

Adenoviral particles that contain a heterologous fiber or a portion 
thereof, whereby binding of the viral particle to dendritic cells is increased 
5 and binding to heparin sulfate proteoglycans (HSP) and to CAR is reduced 
or eliminated compared to a particle that expresses its native fiber are 
provided. In these particles, the adenoviral (Ad) particle, except for the 
fiber, is from a subgroup C adenovirus; and the fiber is from an adeno- 
virus subgroup D, such as Ad19p. In another embodiment, the 

10 heterologous fiber is from Ad30. In other embodiments, the fiber is chi- 
meric and comprises an N-terminal portion from a fiber of a subgroup C 
Ad virus; and the N-terminal portion is sufficient to increase incorporation 
into the particle compared to in its absence. For example, the fiber can 
be from an adenovirus Ad19p, Ad30, Ad37, Ad16 or Ad35 virus. The fi- 

15 ber protein can additionally include one or more further modifications that 
reduce or eliminate interaction of the resulting fiber with one or more cell 
surface proteins, such as CAR, in addition to HSP. 

The adenoviral particle also can include a mutation in the CAR- 
binding region of the capsid and/or a mutation in the a v integrin-binding 

20 region of the capsid, whereby binding to the integrin is eliminated or 

reduced. The CAR-binding region of the capsid that is modified can be on 
a fiber knob. 

In some embodiments, the chimeric fiber contains at least a 
sufficient number of amino acids of Ad19p fiber set forth as SEQ ID No. 
25 34 to target a particle to dendritic cells, and optionally to reduce or 

eliminate binding of the particle to HSP. For example, the Ad19p fiber is 
modified by replacing at least the N-terminal 15, 16 or 17 amino acids 
with the 15, 16 or 17 amino acids of an Ad2 or Ad5 fiber. In other 
embodiments, the chimeric fiber contains at least a sufficient number of 
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amino acids of Ad30 fiber set forth as SEQ ID No. 36 to target a particle 
to dendritic cells, and optionally to reduce or eliminate binding of the 
particle to HSP. For example, the Ad30 fiber is modified by replacing at 
least the N-terminal 1 5, 1 6 or 1 7 amino acids with the 15, 1 6 or 17 
5 amino acids of an Ad2 or Ad5 fiber. 

Hence, also provided are methods for making and using the adeno- 
viral particles that express the modified fibers and combinations of 
modified fibers and modified penton. With the fiber shaft modifications, 
particularly in combination with the fiber knob modifications and the 

10 penton modifications, the adenovirus particles are ablated for binding to 
their natural cellular receptor(s), i.e., they are detargeted. In addition, by 
selection of a subgroup D fiber, the resulting particles are targeted to 
dendritic cells. The particles also can be "retargeted" to a specific cell 
type through the addition of a ligand to the virus capsid, which causes 

15 the virus to bind to and infect such cell. The ligand can be added, for 
example, through genetic modification of a capsid protein gene. 

The nucleic acids, proteins, adenoviral particles and adenoviral 
vectors have a variety of uses. These include in vivo and in vitro uses to 
target nucleic acid to particular cells and tissues, for therapeutic 

20 purposes, including gene therapy, and also for the identification and study 
of cell surface receptors and identification of modes of interaction of 
viruses with cells. 

Nucleic acids encoding the capsid proteins, including the fibers are 
also provided. The nucleic acids can be provided as vectors, particularly 

25 as adenovirus vectors. Many adenoviral vectors are known and can be 
modified as needed in accord with the description herein. Adenoviral 
vectors include, but are not limited to, early generation adenoviral 
vectors, such as E1 -deleted vectors, gutless adenoviral vectors and 
replication-conditional adenoviral vectors, such as oncolytic adenoviral 
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vectors. The adenovirus vectors also can include heterologous nucleic 
acids that encode or provide products, such as tumor antigens and 
antigens from pathogens that induce an immunotherapeutic response 
whereby infection with such pathogen is prevented or the symptoms of 
5 infection reduced. Heterologous nucleic acid can encode a polypeptide 
or comprise or encode a regulatory sequence, such as a promoter or an 
RNA, including RNAi, small RNAs, other double-stranded RNAs, antisense 
RNA, and ribozymes. Promoters include, for example, constitutive and 
regulated promoters and tissue specific promoters, including tumor 

10 specific promoters. The promoter can be operably linked, for example, to 
a gene of an adenovirus essential for replication. 

Cells containing the nucleic acid molecules and cells containing the 
vectors are also provided. Such cells include packaging cells. The cells 
can be prokaryotic or eukaryotic cells, including mammalian cells, such as 

15 primate cells, including human cells. 

Also provided are adenoviral particles that contain the modified 
capsid proteins provided herein. The particles have increased tropism for 
dendritic cells, and also exhibit altered interaction or binding with HSP 
compared to particles that do not contain the modified capsid proteins. In 

20 addition to altered binding to HSP and dendritic cells, the particles can 
include further modifications, such as capsid proteins with altered 
interaction with other receptors as described above. In particular, the 
particles can have altered, typically reduced or eliminated, interaction with 
CAR, a v integrin and/or other receptors. The mutations include mutations 

25 in the fiber knob, penton and hexon. Exemplary fiber knob mutations are 
mutations in the AB loop or CD loop, such as K01 or K012. Such 
mutations include, for example, PD1, KOI, K012 and S* (see, e.g., U.S. 
provisional application Serial No. 60/459,000, and copending U.S. 
application Serial No. 10/351,890). In addition, the particles can include 
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additional ligands for retargeting to selected receptors. The adenoviral 
particles can be from any serotype and subgroup. 

Methods for expressing heterologous nucleic acids in a cell are 
provided. In these methods an adenoviral vector provided herein is 
5 transduced into a cell to deliver the nucleic acid and/or encoded products. 
Transduction can be effected in vivo or in vitro or ex vivo r and can be for 
a variety of purposes including study of gene expression and genetic 
therapy. The cells can be prokaryotic cells, but typically are eukaryotic 
cells, including mammalian cells, such as primate, including human cells. 
10 The cells can be of a specific type, such as a tumor cell or a cell in a 
particular tissue. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a plasmid map for pSKOI . 
Figure 2 is a plasmid map for pNDSQ3.1 K01 . 
15 Figures 3A-3C are plasmid maps of pAdmireRSVnBg (Fig. 3A), 

pSQ1 (Fig. 3B) and pSQ1K012 (Fig. 3C). 

Figure 4 is a plasmid map for pSQ1 PD1 . 

Figures 5A-5B are plasmid maps of pSQ1FK01PD1 (Fig. 5A) and 
pSQ1K012PD1 (Fig. 5B). 

20 Figure 6 shows in vitro transduction efficiency of A549 cells using 

adenoviral vectors containing fiber AB loop knob and/or penton, PD1 
mutations. The following adenoviral vectors were used in these studies: 
AvInBg, AvInBgFKOI, referred to as FKOI, Av1nBgPD1, referred to as 
PD1, and Av1 nBgFKOI PD1 that is referred to as FKOIPDI. 

25 Figure 7A-7B shows in vivo adenoviral-mediated liver gene 

expression (Fig. 7A) and hexon DNA content (Fig. 7B) using adenoviral 
vectors containing fiber AB loop knob and/or penton, PD1 mutations. The 
following adenoviral vectors were used in these studies: AvInBg, 
Av1 nBgFKOI, referred to as FKOI, Av1nBgPD1, referred to as PD1, 
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Av1 nBgFKOI PD1 , referred to as FK01PD1, Av1nBgK012, referred to as 
K012, and Av1 nBgKOI 2PD1 that is referred to as K012PD1. 

Figure 8 is a plasmid map for pFBshuttle(EcoRI). 

Figure 9 is a plasmid map for pSQIHSP. 
5 Figure 10 is a plasmid map for pSQIHSPKOL 

Figure 1 1 is a plasmid map for pSQ1 HSPPD1 . 

Figure 1 2 is a plasmid map for pSQ1 HSPK01 PD1 . 

Figures 13A-13C show the transduction efficiency of A549 and 
HeLa cells using adenoviral vectors containing fiber shaft, knob and/or 
10 penton mutations. Fig. 1 3A shows the dose response for the 

transduction efficiency of A549 cells. Fig. 13B shows the transduction 
efficiency of HeLa cells at 2000 ppc. Figure 13C shows the competition 
analysis of adenoviral vectors containing fiber shaft mutations. 

Figures 14A-14B shows the influence of fiber shaft mutations on in 
15 vivo adenoviral-mediated liver gene expression (Fig. 14A) and hexon DNA 
content (Fig. 14B). 

Figures 15A-15B are plasmid maps of pSQIHSPRGD (Fig. 1 5A) and 
pSQ1HSPK01RGD (Fig. 15B). 

Figure 16 shows that insertion of a RGD targeting ligand can 
20 restore transduction of the vectors containing the HSP binding shaft S* 
mutation. 

Figures 17A-17B are plasmid maps of pSQ1 AD35Fiber (Fig. 17A) 
and pSQ1 Ad35FcRGD (Fig. 17B). 

Figures 18A-18B are maps of plasmids encoding 35F chimeric 
25 fibers. Fig. 18A is a plasmid map of pSQ135T5H, and Fig. 18B is a 
plasmid map of pSQ15T35H. 

Figure 19 shows the results of an in vitro analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 
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Figure 20 shows the results of an in vivo analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 

Figures 21A-21B are plasmid maps of pSQ1Ad41sF (Fig. 21 A) and 
pSQ1 Ad41sFRGD (Fig. 21 B). 
5 Figure 22 shows the results of an in vivo analysis of Ad5 vectors 

containing Ad41 short fiber. 

Figure 23 shows the in vitro analysis of Ad5 based vectors 
containing the Ad41 short fiber which has been re-engineered to contain 
a cRGD ligand in the HI loop. 
10 Figure 24 shows enhanced transduction of AE1-2a cells with the 

Av3nBgFK01 detargeted adenoviral vector using hexadimethrine bromide 
(HB), protamine sulfate (PS) and poly-Iysine-RGD (K14) or the 
anti-penton-TNFa bifunctional protein (apen-TNF). 

Figure 25 shows ablation of HSP interaction decreases 
15 adenoviral-mediated gene transfer to other organs. 

Figure 26 shows in vivo liver transduction with adenoviral vectors 
which encode for /?-galactosidase and contain various mutations to the 
fiber and/or penton proteins. Results are plotted as percent transduction 
as compared to wild type. Two different methods for determining the 
20 level of transduction are shown for each vector. 

Figure 27 shows the adenoviral vector biodistribution to the liver 
and tumor for the vectors containing the S*, KOIS*, and 41 sF fibers. 
DETAILED DESCRIPTION 

A. DEFINITIONS 

25 B. Adenovirus-cell interactions 

1 . Fiber protein 

2. Pseudotyping 
C. Dendritic cell targeting 

1 . Dendritic cells 
30 2. Dendritic cell therapies 

3. Targeting adenoviral particles to dendritic cells 

a. Fiber substitution 

b. Efficient targeting 
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4. Additional modifications 

D. Adenovirus vector detargating 

E. Nucleic acids, adenoviral vectors and cells containing the nucleic acids 
and cells containing the vectors 

5 1 . Preparation of viral particles 

2. Adenoviral vectors and particles 

a. Gutless vectors 

b. Oncolytic vectors 

3. Packaging 

10 4. Propagation and scale-up 

F. Adenovirus expression vector systems 

1 . Nucleic acid gene expression cassettes 

2. Promoters 

G. Heterologous polynucleotides and therapeutic nucleic acids 
15 H. Formulation and administration 

1 . Formulation 

2. Administration 

I. Diseases, disorders and therapeutic products 

J. Examples 

20 A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used 
herein have the same meaning as is commonly understood by one of skill 
in the art to which the invention(s) belong. All patents, patent 
applications, published applications and publications, Genbank sequences, 

25 websites and other published materials referred to throughout the entire 
disclosure herein, unless noted otherwise, are incorporated by reference 
in their entirety. In the event that there are a plurality of definitions for 
terms herein, those in this section prevail. Where reference is made to a 
URL or other such identifier or address, it understood that such identifiers 

30 can change and particular information on the internet can come and go, 
but equivalent information is known and can be readily accessed, such as 
by searching the internet and/or appropriate databases. Reference thereto 
evidences the availability and public dissemination of such information. 
As used herein, the term "adenovirus" or "adenoviral particle" is 

35 used to include any and all viruses that can be categorized as an 

adenovirus, including any adenovirus that infects a human or an animal, 
including all groups, subgroups, and serotypes. Depending upon the 
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context reference to "adenovirus" can include adenoviral vectors. There 
are at least 51 serotypes of adenovirus that are classified into several 
subgroups. For example, subgroup A includes adenovirus serotypes 12, 
18, and 31. Subgroup C includes adenovirus serotypes 1, 2, 5, and 6. 
5 Subgroup D includes adenovirus serotypes 8, 9, 10, 13, 15, 17, 19, 19p, 
20, 22-30, 32, 33, 36-39, and 42-49. Subgroup E includes adenovirus 
serotype 4. Subgroup F includes adenovirus serotypes 40 and 41 . These 
latter two serotypes have a long and a short fiber protein. Thus, as 
used herein an adenovirus or adenovirus particle is a packaged vector or 
10 genome. 

As used herein, "virus," "viral particle," "vector particle," "viral 
vector particle," and "virion" are used interchangeably to refer to 
infectious viral particles that are formed when, such as when a vector 
containing all or a part of a viral genome, is transduced into an 

15 appropriate cell or cell line for the generation of such particles. The 

resulting viral particles have a variety of uses, including, but not limited 
to, transferring nucleic acids into cells either in vitro or in vivo. For 
purposes herein, the viruses are adenoviruses, including recombinant 
adenoviruses formed when an adenovirus vector, such as any provided 

20 herein, is encapsulated in an adenovirus capsid. Thus, a viral particle is a 
packaged viral genome. An adenovirus viral particle is the minimal 
structural or functional unit of a virus. A virus can refer to a single 
particle, a stock of particles or a viral genome. The adenovirus (Ad) 
particle is relatively complex and may be resolved into various 

25 substructures. 

Included among adenoviruses and adenoviral particles are any and 
all viruses that can be categorized as an adenovirus, including any 
adenovirus that infects a human or an animal, including all groups, 
subgroups, and serotypes. Thus, as used herein, "adenovirus" and 
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"adenovirus particle" refer to the virus itself and derivatives thereof and 
cover all serotypes and subtypes and naturally occurring and recombinant 
forms, except where indicated otherwise. Included are adenoviruses that 
infect human cells. Adenoviruses can be wildtype or can be modified in 
5 various ways known in the art or as disclosed herein. Such modifications 
include, but are not limited to, modifications to the adenovirus genome 
that is packaged in the particle in order to make an infectious virus. 
Exemplary modifications include deletions known in the art, such as 
deletions in one or more of the E1a, E1b, E2a, E2b, E3, or E4 coding 

10 regions. Other exemplary modifications include deletions of all of the 

coding regions of the adenoviral genome. Such adenoviruses are known 
as "gutless" adenoviruses. The terms also include replication-conditional 
adenoviruses, which are viruses that preferentially replicate in certain 
types of cells or tissues but to a lesser degree or not at all in other types. 

15 For example, among the adenoviral particles provided herein, are 

adenoviral particles that replicate in abnormally proliferating tissue, such 
as solid tumors and other neoplasms. These include the viruses disclosed 
in U.S. Patent No. 5,998,205 and U.S. Patent No. 5,801,029. Such 
viruses are sometimes referred to as "cytolytic" or "cytopathic" viruses 

20 (or vectors), and if they have such an effect on neoplastic cells, are 
referred to as "oncolytic" viruses (or vectors). 

As used herein, the terms "vector," "polynucleotide vector," 
"polynucleotide vector construct," "nucleic acid vector construct," and 
"vector construct" are used interchangeably herein to mean any nucleic 

25 acid construct that can be used for gene transfer, as understood by those 
skilled in the art. 

As used herein, the term "viral vector" is used according to its 
art-recognized meaning. It refers to a nucleic acid vector construct that 
includes at least one element of viral origin and can be packaged into a 
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viral vector particle. The viral vector particles can be used for the 
purpose of transferring DNA, RNA or other nucleic acids into cells either 
in vitro or in vivo. Viral vectors include, but are not limited to, retroviral 
vectors, vaccinia vectors, lentiviral vectors, herpes virus vectors (e.g., 
5 HSV), baculoviral vectors, cytomegalovirus (CMV) vectors, papillomavirus 
vectors, simian virus (SV40) vectors, Sindbis vectors, Semliki Forest virus 
vectors, phage vectors, adenoviral vectors, and adeno-associated viral 
(AAV) vectors. Suitable viral vectors are described, for example, in U.S. 
Patent Nos. 6,057,155, 5,543,328 and 5,756,086. The vectors provided 

10 herein are adenoviral vectors. 

As used herein, "adenovirus vector" and "adenoviral vector" are 
used interchangeably and are well understood in the art to mean a 
polynucleotide containing all or a portion of an adenovirus genome. An 
adenoviral vector, refers to nucleic encoding a complete genome or a 

15 modified genome or one that can be used to introduce heterologous 

nucleic acid when transferred into a cell, particularly when packaged as a 
particle. An adenoviral vector can be in any of several forms, including, 
but not limited to, naked DNA, DNA encapsulated in an adenovirus 
capsid, DNA packaged in another viral or viral-like form (such as herpes 

20 simplex, and AAV), DNA encapsulated in liposomes, DNA complexed 
with polylysine, complexed with synthetic polycationic molecules, 
conjugated with transferrin, complexed with compounds such as PEG to 
immunologically "mask" the molecule and/or increase half-life, or 
conjugated to a non-viral protein. 

25 As used herein, oncolytic adenoviruses refer to adenoviruses that 

replicate selectively in tumor cells 

As used herein, a variety of vectors with different requirements and 
purposes are described. For example, one vector is used to deliver 
particular nucleic acid molecules into a packaging cell line for stable 
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integration into a chromosome. These types of vectors also are referred 
to as complementing plasmids. A further type of vector carries or delivers 
nucleic acid molecules in or into a cell line (e.g., a packaging cell line) for 
the purpose of propagating viral vectors; hence, these vectors also can be 
5 referred to herein as delivery plasmids. A third "type" of vector is the 
vector that is in the form of a virus particle encapsulating a viral nucleic 
acid and that is comprised of the capsid modified as provided herein. 
Such vectors also can contain heterologous nucleic acid molecules 
encoding particular polypeptides, such as therapeutic polypeptides or 

10 regulatory proteins or regulatory sequences to specific cells or cell types 
in a subject in need of treatment. 

As used herein, the term "motif" is used to refer to any set of 
amino acids forming part of a primary sequence of a protein, either 
contiguous or capable of being aligned to certain positions that are 

15 invariant or conserved, that is associated with a particular function. The 
motif can occur not only by virtue of the primary sequence, but also as a 
consequence of three-dimensional folding. For example, the adenovirus 
fiber is a trimer, hence the trimeric structure can contribute to formation 
of a motif. Alternatively, a motif can be considered as a domain of a 

20 protein, where domain is a region of a protein molecule delimited on the 
basis of function without knowledge of and relation to the molecular 
substructure, as, e.g., the part of a protein molecule that binds to a 
receptor. As shown herein, the motif KKTK constitutes a consensus 
sequence for fiber shaft interaction with HSP. 

25 As used herein, cell therapy is a method of treatment involving the 

administration of live cells. Adoptive immunotherapy is a treatment 
process involving removal of cells from a subject, the processing of the 
cells in some manner ex-vivo and the infusion of the processed cells into 
the same or different subject as a therapy. 
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As used herein, a cell therapeutic refers to the compositions of 
cells that are formulated as a drug whose active ingredient is wholly or in 
part a living cell, 

As used herein, immune cells are the subset of blood cells known 
5 as white blood cells, which include mononuclear cells such as 
lymphocytes, monocytes, macrophages and granulocytes. 

As used herein, T-cells are lymphocytes that express the CD3 
antigen. 

As used herein, helper cells are CD4 4- lymphocytes. 

10 As used herein, regulatory cells are a subset of T-cells, most 

commonly CD4+ T-cells, that are capable of enhancing or suppressing an 
immune response. Regulatory immune cells regulate an immune response 
primarily by virtue of their cytokine secretion profile. Some regulatory 
immune cells also can act to enhance or suppress an immune response by 

15 virtue of antigens expressed on their cell surface and mediate their effects 
through cell-to-cell contact. Th1 and Th2 cells are examples of regulatory 
cells. 

As used herein, effector cells are immune cells that primarily act to 
eliminate tumors or pathogens through direct interaction, such as 
20 phagocytosis, perforin and/or granulozyme secretion, induction of 

apoptosis, etc. Effector cells generally require the support of regulatory 
cells to function and also act as the mediators of delayed type 
hypersensitivity reactions and cytotoxic functions. Examples of effector 
cells are B lymphocytes, macrophages, cytotoxic lymphocytes, LAK cells, 
25 NK cells and neutrophils. 

As used herein, a professional antigen presenting cells (APC) 
include dendritic cells, B-cells and macrophages. 

As used herein, the term "bind" or "binding" is used to refer to the 
binding between a ligand and its receptor, such as the binding of the Ad5 
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knob domain with CAR (coxsackie-adenovirus receptor), with a K d in the 
range of 10-2 to 10-15 mole/I, generally, 10" 6 to 10" 15 , 10" 7 to 10" 15 and 
typically 10" 8 to 10" 15 (and/or a K a of 10 5 -10 12 , 10 7 -10 12 , 10 8 -10 12 l/mole). 
As used herein, specific binding or selective binding means that the 
5 binding of a particular ligand and one receptor interaction (k a or K eq ) is at 
least 2-fold, generally, 5, 10, 50, 100 or more-fold, greater than for 
another receptor. A statement that a particular viral vector is targeted to 
a cell or tissue means that its affinity for such cell or tissue in a host or in 
vitro is at least about 2-fold, generally, 5, 10, 50, 100 or more-fold, 
10 greater than for other cells and tissues in the host or under the in vitro 
conditions. 

As used herein, the term "ablate" or "ablated" is used to refer to an 
adenovirus, adenoviral vector or adenoviral particle, in which the ability to 
bind to a particular cellular receptor is reduced or eliminated, generally 

15 substantially eliminated (i.e., reduced more than 10-fold, 100-fold or 
more) when compared to a corresponding wild-type adenovirus. An 
ablated adenovirus, adenoviral vector or adenoviral particle also is said to 
be detargeted, i.e., the modified adenovirus, adenoviral vector or 
adenoviral particle does not possess the native tropism of the wild-type 

20 adenovirus. The reduction or elimination of the ability of the mutated 
adenovirus fiber protein to bind a cellular receptor as compared to the 
corresponding wild-type fiber protein can be measured or assessed by 
comparing the transduction efficiency (gene transfer and expression of a 
marker gene) of an adenovirus particle containing the mutated fiber 

25 protein compared to an adenovirus particle containing the wild-type fiber 
protein for cells having the cellular receptor. 

As used herein, tropism with reference to an adenovirus refers to 
the selective infectivity or binding that is conferred on the particle by a 
capsid protein, such as the fiber protein and/or penton. 
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As used herein, "penton" or "penton complex" is used herein to 
designate a complex of penton base and fiber. The term "penton" can 
also be used to indicate penton base, as well as penton complex. The 
meaning of the term "penton" alone should be clear from the context 
5 within which it is used. 

As used herein, the term "substantially eliminated" refers to a 
transduction efficiency less than about 1 1 % of the efficiency of the 
wild-type fiber containing virus on HeLa cells. The transduction efficiency 
on Hela cells can be measured (see, e.g., Example 1 of U.S. Patent 

10 Application Serial No. 09/870,203 filed on May 30, 2001, and published 
as U.S. Published application No. 20020137213, and of International 
Patent Application No. PCT/EP01/06286 filed June 1, 2001, and 
published as WO 01/92299). Briefly, HeLa cells are infected with the 
adenoviral vectors containing mutated fiber proteins to evaluate the 

15 effects of fiber amino acid mutations on CAR interaction and subsequent 
gene expression. Monolayers of HeLa cells in 12 well dishes are infected 
with, for example, 1000 particles per cell for 2 hours at 37° C in a total 
volume of, for example, 0.35 ml of the DMEM containing 2% FBS. The 
infection medium is then aspirated from the monolayers and I ml of 

20 complete DMEM containing 10% FBS was added per well. The cells are 
incubated for an sufficient time, generally about 24 hours, to allow for J3- 
galactosidase expression, which is measured by a chemiluminescence 
reporter assay and by histochemical staining with a chromogenic 
substrate. The relative levels of /?-galactosidase activity are determined 

25 using as suitable system, such as the Galacto-Light chemiluminescence 
reporter assay system (Tropix, Bedford, Mass.). Cell monolayers are 
washed with PBS and processed according to the manufacturer's 
protocol. The cell homogenate is transferred to a microfuge tube and 
centrifuged to remove cellular debris. Total protein concentration is 
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determined, such as by using the bicinchoninic acid(BCA) protein assay 
(Pierce, Inc., Rockford, III.) with bovine serum albumin as the assay 
standard. An aliquot of each sample is then incubated with the Tropix 
/?-galactosidase substrate for 45 minutes in a 96 well plate. A 
5 luminometer is used determine the relative light units (RLU) emitted per 
sample and then normalized for the amount of total protein in each 
sample (RLU/ug total protein). For the histochemical staining procedure, 
the cell monolayers are fixed with 0.5% glutaraldehyde in PBS, and then 
were incubated with a mixture of 1 mg of 

10 5-bromo-4-chloro-3-indolyl-/?-D-gaIactoside (X-gal) per ml, 5 mM 

potassium ferrocyanide, 5 mM potassium ferricyanide and 2 mM MgCI 2 in 
0.5 ml of PBS. The monolayers are washed with PBS and the blue cells 
are visualized by light microscopy, such as with a Zeiss ID03 microscope. 
Generally, the efficiency is less than about 9%, and typically is less than 

15 about 8%. 

As used herein, the phrase "reduce" or "reduction" refers to a 
change in the efficiency of transduction by the adenovirus containing the 
mutated or heterologous fiber as compared to the adenovirus containing 
the wild-type fiber to a level of about 75% or less of the wild-type on 

20 HeLa cells. Generally, the change in efficiency is to a level of about 65% 
or less than wild-type. Typically it is about 55% or less. This system is 
able to rapidly analyze modified fiber proteins and/or modified penton 
proteins for desired tropism in the context of the viral particle. 

As used herein, the term "mutate" or "mutation" or similar terms 

25 refers to the deletion, insertion or change of at least one amino acid in the 
protein of interest (e.g. the part of the fiber shaft region interacting with 
HSP). The amino acid can be changed by substitution or by modification 
in a way that derivatizes the amino acid. 
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As used herein, the term "polynucleotide" means a nucleic acid 
molecule, such as DNA or RNA, that encodes a polynucleotide. The 
molecule can include regulatory sequences, and is generally DNA. Such 
polynucleotides are prepared or obtained by techniques known by those 
5 skilled in the art in combination with the teachings contained therein. 

As used herein, the term "viral vector" is used according to its art- 
recognized meaning. It refers to a nucleic acid vector construct that 
includes at least one element of viral origin and can be packaged into a 
viral vector particle. The viral vector particles can be used, for example, 

10 for transferring DNA into cells either in vitro or in vivo. 

As used herein, adenoviral genome is intended to include any 
adenoviral vector or any nucleic acid sequence comprising a modified 
fiber protein. All adenovirus serotypes are contemplated for use in the 
vectors and methods herein. 

15 As used herein, a packaging cell line is a cell line that is able to 

package adenoviral genomes or modified genomes to produce viral 
particles. It can provide a missing gene product or its equivalent. Thus, 
packaging cells can provide complementing functions for the genes 
deleted in an adenoviral genome (e.g., the nucleic acids encoding 

20 modified fiber proteins) and are able to package the adenoviral genomes 
into the adenovirus particle. The production of such particles require that 
the genome be replicated and that those proteins necessary for 
assembling an infectious virus are produced. The particles also can 
require certain proteins necessary for the maturation of the viral particle. 

25 Such proteins can be provided by the vector or by the packaging cell. 

As used herein, detargeted adenoviral particles have ablated 
(reduced or eliminated) interaction with receptors with which native 
particles. It is understood that in vivo no particles are fully ablated such 
that they do not interact with any cells. Detargeted particles have 
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reduced, typically substantially reduced, or eliminated interaction with 
native receptors. For purposes herein, detargeted particles have reduced 
(2-fold, 5-fold, 10-fold, 100-fold or more) binding or virtually no binding 
to CAR or another native receptor. The particles still bind to cells, but the 
5 types of cells and interactions are reduced. 

As used herein, pseudotyping describes the production of 
adenoviral vectors having modified capsid protein or capsid proteins from 
a different serotype from the serotype of the vector itself. One example, 
is the production of an adenovirus 5 vector particle containing an Ad37 or 
10 Ad35 fiber protein. This can be accomplished by producing the 
adenoviral vector in packaging cell lines expressing different fiber 
proteins. 

As used herein, receptor refers to a biologically active molecule 
that specifically or selectively binds to (or with) other molecules. The 

15 term "receptor protein" can be used to more specifically indicate the 
proteinaceous nature of a specific receptor. 

As used herein, the term "heterologous polynucleotide" means a 
polynucleotide derived from a biological source other than an adenovirus 
or from an adenovirus of a different strain or can be a polynucleotide that 

20 is in a different locus from wild-type virus. The heterologous 

polynucleotide can encode a polypeptide, such as a toxin or a therapeutic 
protein. The heterologous polynucleotide can contain regulatory regions, 
such as a promoter regions, such as a promoter active in specific cells or 
tissue, for example, tumor tissue as found in oncolytic adenoviruses. 

25 Alternatively, the heterologous polynucleotide can encode a polypeptide 
and further contain a promoter region operably linked to the coding 
region. 
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As used herein, the term "cyclic RGD" (or cRGD) refers to any 
amino acid that binds to a v integrins on the surface of cells and contains 
the sequence RGD (Arg-Gly-Asp). 

As used herein, the KO mutations refer to mutations in fiber that 
5 knock out binding to CAR. For example, a K01 mutation refers to a 
mutation in the Ad5 fiber and corresponding mutations in other fiber 
proteins. In Ad5, this mutation results in a substitution of fiber amino 
acids 408 and 409, changing them from serine and proline to glutamic 
acid and alanine, respectively. As used herein, a K012 mutation refers to 

10 a mutation in the Ad5 fiber and corresponding mutations in other fiber 
proteins. In Ad5, this mutation is a four amino acid substitution as 
follows: R512S, A515G, E516G, and K517G. Other KO mutations can 
be identified empirically or are known to those of skill in the art. 

As used herein, PD mutations refer to mutations in the penton gene 

15 that ablate binding by the encoded to a y integrin by replacing the RGD 
tripeptide. The PD1 mutation exemplified herein results in a substitution 
of amino acids 337 through 344 of the Ad5 penton protein, HAIRGDTF 
(SEQ ID NO. 49), with amino acids SRGYPYDVPDYAGTS (SEQ ID NO. 
50), thereby replacing the RGD tripeptide. 

20 As used herein, reference to an amino acid in an adenovirus protein 

or to a nucleotide in an adenovirus genome is with reference to Ad5, 
unless specified otherwise. Corresponding amino acids and nucleotides in 
other adenovirus strains and modified strains and in vectors can be 
identified by those of skill in the art. Thus, recitation of a mutation is 

25 intended to encompass all adenovirus strains that possess a 
corresponding locus. 

As used herein, tumor antigen refers to a cell surface protein 
expressed or located on the surface of tumor cells. 
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As used herein, treatment means any manner in which the 
symptoms of a condition, disorder or disease are ameliorated or otherwise 
beneficially altered. 

As used herein, a therapeutically effective product is a product that 
5 is encoded by heterologous DNA that, upon introduction of the DNA into 
a host, a product is expressed that effectively ameliorates or eliminates 
the symptoms, manifestations of an inherited or acquired disease or that 
cures said disease. 

As used herein, a subject is an animal, such as a mammal, typically 

10 a human, including patients. 

As used herein, genetic therapy involves the transfer of 
heterologous DNA to the certain cells, target cells, of a mammal, 
particularly a human, with a disorder or conditions for which such therapy 
is sought. The DNA is introduced into the selected target cells in a 

15 manner such that the heterologous DNA is expressed and a therapeutic 
product encoded thereby is produced. Alternatively, the heterologous 
DNA may in some manner mediate expression of DNA that encodes the 
therapeutic product, it may encode a product, such as a peptide or RNA 
that in some manner mediates, directly or indirectly, expression of a 

20 therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product to replace a defective gene or supplement a 
gene product produced by the mammal or the cell in which it is 
introduced. The introduced nucleic acid may encode a therapeutic 
compound, such as a growth factor inhibitor thereof, or a tumor necrosis 

25 factor or inhibitor thereof, such as a receptor therefor, that is not normally 
produced in the mammalian host or that is not produced in therapeutically 
effective amounts or at a therapeutically useful time. The heterologous 
DNA encoding the therapeutic product may be modified prior to 
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introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, a therapeutic nucleic acid is a nucleic acid that 
encodes a therapeutic product. The product can be nucleic acid, such as 
5 a regulatory sequence or gene, or can encode a protein that has a 

therapeutic activity or effect. For example, therapeutic nucleic acid can 
be a ribozyme, antisense, double-stranded RNA, a nucleic acid encoding a 
protein and others. 

As used herein, "homologous" means about greater than 25% 

10 nucleic acid sequence identity, such as 25%, 40%, 60%, 70%, 80%, 
90% or 95%. If necessary the percentage homology will be specified. 
The terms "homology" and "identity" are often used interchangeably. In 
general, sequences are aligned so that the highest order match is 
obtained (see, e.g.: Computational Molecular Biology, Lesk, A.M., ed., 

15 Oxford University Press, New York, 1988; Biocomputing: Informatics and 
Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, 
H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in 
Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence 

20 Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, 
New York, 1991; Carillo et aL (1988) SIAM J Applied Math 48: 1 073) . 
By sequence identity, the number of conserved amino acids are 
determined by standard alignment algorithms programs, and are used with 
default gap penalties established by each supplier. Substantially 

25 homologous nucleic acid molecules would hybridize typically at moderate 
stringency or at high stringency all along the length of the nucleic acid or 
along at least about 70%, 80% or 90% of the full-length nucleic acid 
molecule of interest. Also contemplated are nucleic acid molecules that 
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contain degenerate codons in place of codons in the hybridizing nucleic 
acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences 
that are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% 
5 or 99% "identical" can be determined using known computer algorithms 
such as the "FAST A" program, using for example, the default parameters 
as in Pearson eta/. (1988) Proc. Natl. Acad. Sci. USA 85:2444 (other 
programs include the GCG program package (Devereux, J., et aL, Nucleic 
Acids Research 12f/):387 (1984)), BLASTP, BLASTN, FASTA (Atschul, 

10 S.F., etal. f J Molec Biol 2/5:403 (1990); Guide to Huge Computers, 

Martin J. Bishop, ed., Academic Press, San Diego, 1994, and Carillo et aL 
( 1 988) SI AM J Applied Math 48: 1 073) . For example, the BLAST function 
of the National Center for Biotechnology Information database can be 
used to determine identity. Other commercially or publicly available 

15 programs include, DNAStar "MegAlign" program (Madison, Wl) and the 
University of Wisconsin Genetics Computer Group (UWG) "Gap" program 
(Madison Wl)). Percent homology or identity of proteins and/or nucleic 
acid molecules can be determined, for example, by comparing sequence 
information using a GAP computer program (e.g., Needleman et aL 

20 (1970) J. Mol. Biol. 48:443, as revised by Smith and Waterman ((1981) 
Adv. Appl. Math. 2:482). Briefly, the GAP program defines similarity as 
the number of aligned symbols (i.e., nucleotides or amino acids) which 
are similar, divided by the total number of symbols in the shorter of the 
two sequences. Default parameters for the GAP program can include: (1) 

25 a unary comparison matrix (containing a value of 1 for identities and 0 for 
non-identities) and the weighted comparison matrix of Gribskov et aL 
(1986) Nucl. Acids Res. 14:6745, as described by Schwartz and Dayhoff, 
eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 
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3.0 for each gap and an additional 0.10 penalty for each symbol in each 
gap; and (3) no penalty for end gaps. Therefore, as used herein, the term 
"identity" represents a comparison between a test and a reference 
polypeptide or polynucleotide. 
5 As used herein, the term "at least 90% identical to" refers to 

percent identities from 90 to 99.99 relative to the reference polypeptides. 
Identity at a level of 90% or more is indicative of the fact that, assuming 
for exemplification purposes a test and reference polynucleotide length of 
100 amino acids are compared, no more than 10% (i.e., 10 out of 100) 

10 of amino acids in the test polypeptide differs from that of the reference 
polypeptides. Similar comparisons can be made between a test and 
reference polynucleotides. Such differences can be represented as point 
mutations randomly distributed over the entire length of an amino acid 
sequence or they can be clustered in one or more locations of varying 

15 length up to the maximum allowable, e.g. 10/100 amino acid difference 
(approximately 90% identity). Differences are defined as nucleic acid or 
amino acid substitutions, or deletions. At the level of homologies or 
identities above about 85-90%, the result should be independent of the 
program and gap parameters set; such high levels of identity can be 

20 assessed readily, often without relying on software. 

As used herein: stringency of hybridization in determining 
percentage mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1% SDS, 65°C 

2) medium stringency: 0.2 x SSPE, 0.1 % SDS, 50°C 
25 3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 

Those of skill in this art know that the washing step selects for 
stable hybrids and also know the ingredients of SSPE (see, e.g., 
Sambrook, E.F. Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press (1989), vol. 3, p. B.13, see, 
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also, numerous catalogs that describe commonly used laboratory 
solutions). SSPE is pH 7.4 phosphate- buffered 0.18 M NaCI. Further, 
those of skill in the art recognize that the stability of hybrids is determined 
by T m , which is a function of the sodium ion concentration and 
5 temperature (T m = 81 .5° C-1 6.6(Iog 10 [Na + ]) + 0.41 (%G + 0-600/1)), so 
that the only parameters in the wash conditions critical to hybrid stability 
are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 

10 limitation, procedures using conditions of low stringency are as follows 
(see also Shilo and Weinberg, Proc. Natl. Acad. Sci. USA 75:6789-6792 
(1981)): Filters containing DNA are pretreated for 6 hours at 40°C in a 
solution containing 35% formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 
5 mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 //g/ml denatured 

15 salmon sperm DNA (10X SSC is 1.5 M sodium chloride, and 0.15 M 
sodium citrate, adjusted to a pH of 7). 

Hybridizations are carried out in the same solution with the 
following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 
/yg/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 

20 cpm 32 P-Iabeled probe is used. Filters are incubated in hybridization 

mixture for 18-20 hours at 40°C, and then washed for 1.5 hours at 55°C 
in a solution containing 2X SSC, 25 mM Tris-HCI (pH 7.4), 5 mM EDTA, 
and 0.1% SDS. The wash solution is replaced with fresh solution and 
incubated an additional 1 .5 hours at 60°C. Filters are blotted dry and 

25 exposed for autoradiography. If necessary, filters are washed for a third 
time at 65-68°C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art {e.g., as 
employed for cross-species hybridizations). 
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By way of example and not way of limitation, procedures using 
conditions of moderate stringency include, for example, but are not 
limited to, procedures using such conditions of moderate stringency are 
as follows: Filters containing DNA are pretreated for 6 hours at 55 °C in a 
5 solution containing 6X SSC, 5X Denhart's solution, 0.5% SDS and 100 
//g/ml denatured salmon sperm DNA. Hybridizations are carried out in the 
same solution and 5-20 X 10 6 cpm 32 P-labeled probe is used. Filters are 
incubated in hybridization mixture for 18-20 hours at 55°C, and then 
washed twice for 30 minutes at 60°C in a solution containing 1X SSC 

10 and 0.1% SDS. Filters are blotted dry and exposed for autoradiography. 
Other conditions of moderate stringency which can be used are well- 
known in the art. Washing of filters is done at 37°C for 1 hour in a 
solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using 

15 conditions of high stringency are as follows: Prehybridization of filters 
containing DNA is carried out for 8 hours to overnight at 65 °C in buffer 
composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% 
PVP, 0.02% Ficoll, 0.02% BSA, and 500 /jg/m\ denatured salmon sperm 
DNA. Filters are hybridized for 48 hours at 65°C in prehybridization 

20 mixture containing 100 /vg/ml denatured salmon sperm DNA and 5-20 X 
10 6 cpm of 32 P-labeled probe. Washing of filters is done at 37 °C for 
1 hour in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 
0.01% BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 
minutes before autoradiography. Other conditions of high stringency 

25 which can be used are well known in the art. 

The term substantially identical or substantially homologous or 
similar varies with the context as understood by those skilled in the 
relevant art and generally means at least 60% or 70%, preferably means 
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at least 80%, 85% or more preferably at least 90%, and most preferably 
at least 95% identity. 

As used herein, substantially identical to a product means 
sufficiently similar so that the property of interest is sufficiently 
5 unchanged so that the substantially identical product can be used in place 
of the product. 

As used herein, substantially pure means sufficiently homogeneous 
to appear free of readily detectable impurities as determined by standard 
methods of analysis, such as thin layer chromatography (TLC), gel 

10 electrophoresis and high performance liquid chromatography (HPLC), used 
by those of skill in the art to assess such purity, or sufficiently pure such 
that further purification would not detectably alter the physical and 
chemical properties, such as enzymatic and biological activities, of the 
substance. Methods for purification of the compounds to produce 

15 substantially chemically pure compounds are known to those of skill in 
the art. A substantially chemically pure compound can, however, be a 
mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

The methods and preparation of products provided herein, unless 

20 otherwise indicated, employ conventional techniques of chemistry, 
molecular biology, microbiology, recombinant DNA, genetics, 
immunology, cell biology, cell culture and transgenic biology, which are 
within the skill of the art (see, e.g., Maniatis eta/. (1982) Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 

25 Spring Harbor, NY); Sambrook et al. (1989) Molecular Cloning: A 

Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY; Ausubel et al. (1992) Current Protocols in Molecular Biology, 
Wiley and Sons, New York; Glover (1985) DNA Cloning I and II, Oxford 
Press; Anand (1992) Techniques for the Analysis of Complex Genomes 
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(Academic Press); Guthrie and Fink (1991) Guide to Yeast Genetics and 
Molecular Biology, Academic Press; Harlow and Lane (1988) Antibodies: 
A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY; Jakoby and Pastan, eds. (1979) Cell Culture. Methods in Enzymology 
5 58, Academic Press, Inc., Harcourt Brace Jaovanovich, NY; Nucleic Acid 
Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal 
Cells (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And 
Enzymes (IRL Press, 1986); B. Perbal (1984), A Practical Guide To 
Molecular Cloning-, Gene Transfer Vectors For Mammalian Cells (J. H. 

10 Miller and M. P. Calos eds., 1987, Cold Spring Harbor Laboratory); 
Methods In Enzymology, Vols. 154 and 155 (Wu et al. eds.); 
Immunochemical Methods In Cell And Molecular Biology (Mayer and 
Walker, eds., Academic Press, London, 1987); Handbook Of Experimental 
Immunology, Volumes l-IV (D. M. Weir and C. C. Blackwell, eds., 1986); 

15 Hogan et aL (1986) Manipulating the Mouse Embryo, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. 
B. Adenovirus-cell interactions 

The ability of different subgroups of adenovirus to interact with, or 
not interact with, specific cell types and/or particular receptors can be 

20 exploited to produce adenoviruses with desired specificity. For example, 
adenovirus can be modified such that they are able to more efficiently 
target specific cell types and/or tissues. Adenovirus serotypes also can 
be modified to reduce or eliminate their interactions with a natural 
receptor and thereby reduce or eliminate the interaction of adenovirus 

25 with a particular cell type and/or tissue. Thus, provided herein are 

modifications of the viral capsid that alter the interaction of an adenovirus 
with its natural receptors and/or cell types and modifications that target 
an adenovirus to interact with other receptors and/or cell types. In 
particular, modifications that result targeting to dendritic cells are 
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provided. Also provided are modifications that result in reduction or 
ablation of the interaction of an adenovirus, particularly in vivo, with other 
cell types. 

Different adenovirus serotypes infect different cell types, largely 
5 because their fibers bind distinct receptors (Defer et al. (1990) J. Virol. 
54:3661-3673; Stevenson et al. (1995) J. Virol. 55:2850-2857; Arnberg 
et al. (2000) J. Virol. 74:42-48). For subgroup C viruses (including Ad2 
and Ad5), coxsackievirus and adenovirus receptor (CAR) serves as the 
cellular receptor (Tomko and Philipson (1997) Proc. Natl. Acad. Sci. 
10 U.S.A. 54:3352-3356; Bergelson et al. (1997) Science 275:1320-1323). 
While adenoviruses from several other subgroups also bind CAR (Roelvink 
et al. (1998) J. Virol. 72:7909-7915), infection and competition studies 
indicate that they use other proteins as primary receptors (Arnberg et al. 
(2000) J. Virol. 74:42-48; Huang et al. (1 999) J. Virol. 73:2798-2802; 
15 Segerman et al. (2000) J. Virol. 74:1457-1467; Wu et al. (2001) Virology 
275:78-89; Shayakhmetov et al. (2000) J. Virol. 74:2567-2584). 
1 . Fiber Protein 

The adenovirus fiber protein is a homotrimeric protein containing 
three polypeptides of 62 kDa. Ad fiber proteins are located at each of the 

20 twelve icosahedral vertices of the viral particle (Chroboczek et al. (1995) 
Curr. Top. Microbiol. Immunol. 755:163-200). The sequences of the 
fiber gene from a variety of serotypes including adenovirus serotypes 2 
(Ad2), Ad5, Ad3, Ad12, Ad35, Ad40, and Ad41 are known. There are at 
least 21 different fiber genes in Genbank. Sequence analysis of fiber 

25 proteins from several different adenovirus serotypes (Hong et al. (1988) 
Virology 767:545-553; Kidd et al. (1990) Virology 775:139-150; Signas 
et al. (1985) J. Virol. 53:672-678) and the crystal structure of Ad2 fiber 
(van Raaij et al. (1999) Nature 407:935-938) have identified three 
structural domains in the fiber. The N-terminal region of the fiber protein 
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interacts with the penton base proteins to anchor the fiber to the viral 
particle. The C-terminal knob region is responsible for mediating virus 
binding to host cells. These two regions are connected via a long, thin 
central shaft region, which contains a variable number of shaft repeats, 
5 each repeat being made up of 15 residues designated as a-o. The 
repeating domains of the fiber shaft are characterized by an invariant 
glycine or proline at position j and a conserved pattern of hydrophobic 
residues (van Raaij et al. (1999) Nature 407:935-938). A conserved 
stretch of amino acids which includes the sequence TLWT (SEQ ID No. 

10 46) marks the boundary between the repeating units of beta structure in 
the shaft and the globular head domain. The number of shaft repeats in 
Ad fiber depends on the adenoviral serotype. For example, Ad2 and Ad5 
fiber proteins include 22 shaft repeats, while Ad3 contains only 5 repeats 
(Chroboczek et al. (1995) Curr. Top. Microbiol. Immunol. 755:163-200). 

15 The C-terminal fiber knob mediates attachment to CAR, which is a 

46 kDa protein of the immunoglobulin superfamily that is found on many 
different cell types (Bergelson et al. (1997) Science 275:1 320-1 323). A 
crystal structure of the Ad 12 fiber in complex with CAR demonstrates 
that sequences in the fiber knob, specifically the AB loop, interact with 

20 the first Ig-like domain of CAR (Bewley et al. (1999) Science 256:1579- 
1583). Following attachment to CAR, binding of the Ad penton base 
protein to a v integrins enables internalization and penetration of the virus 
into the cell. 

Adenovirus interactions with specific cell types are also influenced 
25 by the capacity to bind HSP. As noted, adenoviruses having fiber shafts 
that do not interact with HSP include (a) adenoviruses of subgroup B, 
e.g., Ad3, Ad35, Ad7, Ad11, Ad16, Ad21, (b) adenoviruses of subgroup 
F, e.g., Ad40 and Ad41, specifically the short fiber, and (c) adenoviruses 
of subgroup D, which includes adenovirus serotype 8, 9, 10, 13, 15, 17, 
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19, 20, 22-30, 32, 33, 36-39, and 42-49. Serotype 19 has variants. 
Ad19p is a nonpathogenic variant of Ad 19 (Arnberg et al. (1998) Virology 
227:239-244) while Ad19a, along with Ad8 and Ad37, are major causes 
of EKC. Ad19a and Ad37 have identical fiber proteins (Arnberg et al. 
5 (1998) Virology 227:239-244) and have similar tropism in vivo. 
2, Pseudotyping 

Adenoviral vectors can be modified for targeting specific tissue 
and/or cell types through a variety of modifications, including 
modifications to the viral capsid, particularly to the fiber protein. 

10 Modifications provided herein include, but are not limited to, pseudotyping 
of the viral particle with heterologous and/or chimeric fiber protein. 

Fibers that use non-CAR receptors can direct infection of a variety 
of different cell types (Shayakhmetov et al. (2000) J. Virol. 74:2567- 
2584; Von Seggern et al. (2000) J. Virol. 74:354-362; Law and Davidson 

15 (2002) J. Virol. 75:656-661; Havenga et al. (2002) J. Virol. 76:4612- 
4620; Gall et al. (1996) J. Virol. 70:2116-2123; Chillon et al. (1999) J. 
Virol. 73:2537-2540), thus providing a means for adenovirus vector 
targeting. Adenovirus packaging cell systems allow generation of viral 
particles with essentially any desired fiber protein by trans- 

20 complementation of a fiber-deleted virus (Von Seggern et al. (2000) J. 
Virol. 74:354-362; Von Seggern et al. (1 999) J. Virol. 73:1601-1608). 
This technology, referred to as psuedotyping, allows generation of 
targeted viral particles that can be used to study tropism in vitro and in 
vivo, as well as permitting construction and propagation of viruses whose 

25 fibers do not bind to the producer cells normally used for Ad growth. 

As described herein, psuedotyping can be used to identify fibers 
from subgroup D adenoviruses that confer enhanced infectivity of 
dendritic cells. Fiberless adenovirus vectors can be pseudotyped with 
fiber proteins from different serotypes to generate adenovirus particles 
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with heterologous fiber proteins. Pseudotyping can be accomplished, for 
example, by expression in cells that contain expression plasmids encoding 
the fibers for pseudotyping. These vectors and plasmids can be 
generated as described herein or by any method known to those of skill in 
5 the art. 

Accordingly, provided herein are modified fibers for targeting and 
detargeting and methods of making such fiber proteins and adenoviruses 
containing the fiber proteins. Among the cell types provided herein for 
adenovirus targeting are dendritic cells. 

10 C. Dendritic cell targeting 

Denditric cells have numerous physiological features that render 
them desirable targets for immunotherapeutic approaches. Dendritic cells 
pick up antigens and migrate from the tissues of the body to the lymphoid 
tissues. There these cells present the antigens in lymphoid organs by 

15 displayimg a foreign epitope bound to an MHC protein and trigger humoral 
and cellular immune responses. Dentritic cells have the ability to 
distinguish different types of pathogens, such as viruses, bacteria, fungi, 
and switch on specifically targeted immune-response genes against them. 
They are antigen-presenting cells that stimulate T lymphocytes into 

20 attacking infection. Hence delivery of heterologous antigens for 

presentation by dendritic cells provides a means for triggering humoral 
and cellular immune responses against such antigens. Also as noted, 
expression of particular products in dendritic cells also can function to 
inhibit or decrease in inappropriate or undesirable immune response, such 

25 occurs in allergies, autoimmune diseases and inflammatory responses. 
1 . Dendritic Cells 

Dendritic cells (abbreviated DCs), which have a variety of important 
physiological features in the immune system, can serve as targets for 
immunotherapy and vaccine development. Dendritic cells play an 



WO 2004/099422 



PCT/US2004/009219 



-37- 

important role in establishing an immune response. Dendritic cells are 
found in T-cell rich areas of the lymphoid tissues where they present 
antigen to T cells to stimulate the adaptive immune response (Janeway 
and Travers (1997) Immunobiology: the immune system in health and 
5 disease, third edition, Current Biology Ltd., New York, N.Y.). 

As an antigen presenting cell, the role of the dendritic cell is to 
capture foreign and self antigens, process them into peptides, and present 
the peptides in the context of MHC (major histocompatibility complex) 
proteins to T lymphocytes. Dendritic cells are highly specialized and 

10 efficient APCs and they control the magnitude, quantity, and memory of 
the adaptive immune responses that they trigger (Steinman and Pope 
(2002) J. Clin. Invest. 705:1519-1526). The T cells activated by 
dendritic cells presenting antigen include, T-helper CD4+ cells, 
particularly cells designated Th1, and CD8+ cytotoxic T lymphocytes 

15 (CTLs). Activated Th1 cells produce IFN-^and induce proliferation and 
antibody production of antigen-specific B lymphocytes. CTLs activated 
by dendritic cells kill cells displaying antigen (such as virus-infected cells) 
by releasing cytotoxic granules into the cell (see, e.g., Steinman and Pope 
(2002) J. Clin. Invest. 705:1519-1526). 

20 As noted, dendritic cells express high levels of MHC molecules for 

antigen presentation rendering them highly efficient APCs. In addition 
they also express a high level of co-stimulatory molecules, which are 
important for enhancing an immune response. Dendritic cells also 
produce a wide array of immunostimulatory cytokines (Scanlan and Jager 

25 (2001) Breast Cancer Res. 3:95-98) and there potentiate and participate 
in immune responses, and have been used as targets for vaccine 
development. Engineering of dentritic cells to express a tumor antigen 
has been pursued as an approach to tumor immunotherapy. For example, 
genetically modified dendritic cells that express particular antigens, such 
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as tumor antigens, can be used as vaccines. In addition, genetic therapy 
that targets such cells in vivo can be used to generate APCs in vivo in 
immunotherapeutic methods. 

Dendritic cells also are capable of diminishing an immune response. 
5 Dendritic cells can be exploited to aid in vaccination against 

autoimmunity, allergy and transplantation rejection, all of which result 
from an uncontrolled or unchecked immune response (Hawiger et at. 
(2001) J. Exp. Med. 194:769-779; Steinman et al. (2003) Annual Rev. 
Immunol. 27:685-71 1). For example, dendritic cells appear to be 

10 important for peripheral T cell tolerance (see, e.g., Steinman et al. (2000) 
J. Exp. Med. 757:41 1-416). Tolerance, or unresponsiveness to an 
antigen, is critical for avoidance of autoimmunity. Dendritic cells are 
capable of inducing significant antigen-specific tolerance in peripheral 
lymphoid tissues (Hawiger et al. (2001) J. Exp. Med. 194:769-779), and 

15 also are capable of inducing tolerance to transplantation antigens (see, Fu 
et al. (1996) Transplantation 52:659-665) and contact allergens (se, 
Steinbrink et al. (1997) J. Immunol. 755:4772-4780). 

Thus, vaccine and immmunotherapeutic strategies involving den- 
dritic cells are important for the treatment of a variety of clinically 

20 important autoimmune and related diseases, including systemic lupus 

erythematosus, myasthenia gravis, rheumatoid arthritis, insulin-dependent 
diabetes mellitus and Graves' disease, as well as for vaccination or 
treatment of cancers and diseases caused by pathogens. 
2. Dendritic Cell Therapies 

25 Different methods for delivery of the antigen gene to dendritic cells 

have been explored, but these generally require ex vivo manipulation of 
cells, including transfection, and then infusion of the cells. This is 
complicated, expensive, and requires generation of patient-specific 
reagents. 
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Adenovirus can be used for dendritic cell therapies. The ability of 
adenovirus serotypes to infect specific cell types, such as dendritic cells, 
can in part be attributed to their interaction, or a lack of interaction, with 
CAR. For example, the requirement for high doses of Ad5 in dendritic cell 
5 (DC) transduction can be explained by the lack of CAR expression on 
dendritic cells (Linette et al. (2000) J. Immunol. 764:3402-3412; Tillman 
et al. (1999) J. Immunol. 762:6378-6383). Several approaches have 
been used to improve DC infection by adenoviruses. A bispecific 
antibody (Ab) which bound to the fiber knob as well as to CD40 (which is 

10 expressed on the surface of DCs) was used to target dendritic cells 

(Tillman et al. (1999) J. Immunol. 762:6378-6383). This study showed 
that the cells expressed sufficient a v integrins for efficient infection by the 
DC-binding adenovirus. This approach requires that production of the 
viral vector and the antibody, and purifcation of the complex, in clinically 

15 acceptable forms. This presents problems with scale-up and 

manufacturing, and negates many of the advantages of adenovirus 
vectors, most notably simple production and purification, as well as 
vector stability. 

Therefore to overcome these limitations, provided herein are 

20 adenoviral vectors that have been modified for efficiently targeting 

dendritic cells. The adenoviral vectors can be used for targeting such 
cells in vivo and ex vivo for immunotherapy and in vitro for studying 
dendritic cell function. 

3, Targeting Adenoviral Particles to Dendritic Cells 

25 Numerous studies have shown that adenovirus (Ad)-mediated 

delivery to dendritic cells can lead to anti-tumor response, but the Ad 
vectors generally used in gene therapy are based on a serotype (Ad5) that 
infects dendritic cells very inefficiently. After in vivo Ad administration, 
infection of a fairly small number of dendritic cells has been directly 
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demonstrated (Zhang et al. (2001) Mol. Therapy 5:697-707; Oberholzer 
etal. (2002) J. Immunol. 168:341 2-341 8; Jooss et al. (1998) J. WroA 
72:4212-4223) and appears to be largely responsible for the cellular 
immune response observed (Zhang etal. (2001) Mol. Therapy 5:697-707; 
5 Jooss etal. (1998) J. Virol. 72:4212-4223). Since Ad5 infects dendritic 
cells poorly (Dietz etal. (1998) Blood 57:393-398; Wan et al. (1997) 
Human Gene Ther. 5:1355-1363; JonuleitefaA (2000) Gene Therapy 
7:249-254; Linette et al. (2000) J. Immunol. 764:3402-3412; Tillman et 
al. (1999) J. Immunol. 762:6378-6383) high multiplicities of infection are 
10 required. 

Most testing has been done using primary cultures of dendritic cells 
derived from peripheral blood or bone marrow by incubation with 
cytokines, usually GM-CSF and IL-4 (Inaba et al. (1998) Isolation of 
dendritic cells In Current Protocols in Immunology, John Wiley & Sons, 

15 Inc. Philadelphia, 3.7.1-3.7.15). Ex vivo infection of dendritic cells, 

followed by re-infusion, has been found to generate effective anti-tumor 
responses (Wan et al. (1997) Human Gene Ther. 5:1355-1363; Linette et 
al. (2000) J. Immunol. 754:3402-3412; Inoue et al. (1999) Immunol. 
Lett. 70:77-81; Sonderbye etal. (1998) Exp. Clin. Immunogenet. 75:100- 

20 111; Ranieri etal. (1999) J. Virol. 75:10416-10425; Ribas etal. (1997) 
Cancer Res. 57:2865-2869; Miller et al. (2000) Human Gene Ther. 
7 7:53-65). Ex vivo infection is not the ideal means for vaccination and 
immunotherapy. 

In addition, recombinant adenoviruses with fiber proteins from the 
25 subgroup B viruses Ad16 and Ad35 have been found to have an 

increased ability to infect human dendritic cells (Havenga et al. (2002) J. 
Virol. 75:4612-4620; Rea et al. (2001) J. Immunol. 755:5236-5244). 
Subgroup B viruses, however, appear to have a broad tropism. For 
example, they transduce a wide variety of cultured cell lines as well as 
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primary cells from a number of different tissue types (Havenga et a/. 
(2002) J. Virol. 75:4612-4620), evidencing such broad tropism. This 
apparent lack of cell-specificity (broad tropism) demonstrated by subgroup 
B indicates that pseudotyping Ad5 or Ad2 virsues with Ad subgroup B 
5 fibers is not advantageous. 

a. Fiber Substitution 
It is shown herein that, contrary to reports in the literature, 
subgroup D viruses can target dendritic cells. Subgroup D viruses exhibit 
a narrower tropism than subgroup B viruses. It is shown herein that 

10 fibers from certain non CAR-using Ad serotypes, particularly Ad subgroup 
D viruses, effectively target receptors on dendritic cells. 

Modified adenvirus particles can be generated by substituting 
dendritic cell-tropic fibers, such as the Subgroup D fibers, or portions 
thereof in place of the Ad subgroup C (or other Ad subgroup, including B 

15 and D, with a heterologous fiber), such as Ad5 or Ad2 fiber to produce 
degarteted (reduced binding to CAR, HSP) and retargeted (to dendritic 
cells) viral particles. 

Any portion of the fiber can replaced with a portion of a subgroup 
D fiber, so long as the portion of the subgroup D fiber confers targeting to 

20 dendritic cells and the fiber assembles into the viral capsid. In one 

embodiment, the entire fiber protein is replaced with a subgroup D fiber. 
In another embodiment, the entire fiber, except for the N-terminus is 
replaced. For example, at least about 1 6 or 17 amino acids or more, up 
to about 60, 70, 80, 90, 100 or more amino acids of N-terminus of the 

25 native fiber is retained to aid in the incorporation of the fiber into the 
native particle. 

Included in the modified adenoviruses provided herein are those 
with fiber protein from subgroup B and D, including, but not limited to 
Ad19p, Ad37, Ad30, Ad8, Ad9, Ad 10, Ad 13, Ad15, Ad17, Ad 19, Ad20, 
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Ad16, Ad35 and adenovirus serotypes 22-30, 32, 33, 36-39, and 42-49, 
expressed on adenoviral particles, particularly subgroup C particles. 
Among the modified capsid proteins areprovided herein are those which 
include fibers containing the sequence of amino acids set forth in any of 
5 SEQ ID NOs. 32, 34, 36, 38 or 40; or a sequence of amino acids having 
60%, 70%, 80%, 90%, 95% or greater sequence identity with a 
sequence of amino acids set forth in any of SEQ ID NOs. 32, 34, 36, 38 
or 40; or a sequence of amino acids encoded by a sequence of 
nucleotides that hybridizes under conditions of high stringency along at 

10 least 70%, at least 80% or at least 90% of its length to a sequence of 
nucleotides that encodes a sequence of amino acids set forth in any of 
SEQ ID NOs. 32, 34, 36, 38 or 40. The fiber proteins can be modified, 
such as described herein, by replacement of the N-terminus to facilitate 
incorporation into the viral particle of a different subgroup, particularly 

15 subgroup C. Such modification is generally inclusion of at least 16 or 17 
amino acids up to about 60 or 61 or more contiguous amino acids from 
the N-terminus of the native fiber such that dendritic cell targeting is 
introduced. 

In one exemplary embodiment, a packaging cell strategy is used to 
20 produce particles of a fiber-deleted Ad5 vector containing fiber proteins 
from Ads of subgroup B (Ad3, Ad16, Ad35), subgroup C (Ad5), and 
subgroup D (Ad19p, Ad30, Ad37). Nucleotide and amino acid sequences 
of Ad fibers are set forth in SEQ ID NOs. 41-44 (exemplary chimeric 
fibers) and 31-40 (exemplary wild type fibers that can be modified by 
25 replacement of the N-terminus). The resulting particles exhibit significant 
differences in dendritic cells tropism as demonstrated by their ability to 
infect primary murine bone marrow-derived DC in vitro. Furthermore, the 
particles pseudotyped with the subgroup D particles efficiently and 
specifically target dendritic cells. As described in the herein (see e.g., the 
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Examples) subgroup B fibers appear to bind to receptors distinct from and 
more ubiquitously expressed than those bound by subgroup D fibers. 

While particles with the Ad5 fiber infect dendritic cells rather 
poorly, vector particles pseudotyped with subgroup D fibers or portions 
5 thereof, such as the Ad19p and Ad37, were particularly effective. Thus, 
adenovirus particles, particularly subgroup C particles, modified to 
express all or a portion of a subgroup D fiber, efficiently target dendritic 
cells and can be used to deliver heterologous nucleic acids to such cells in 
vivo and ex vivo. In addition, such particles have reduced binding to 
10 HSP-expressing cells, such as hepatocytes and to CAR~expressing cells 
compared to unmodified subgroup B viral particles, 
b. Efficient Targeting 
Provided herein are recombinant adenoviruses with a limited 
tropism that target dendritic cells. The recombinant adenoviruses can be 
15 used for gene therapy and/or vaccination approaches. Administration can 
be effected in vivo, such as systemically, or ex vivo by contacting cells 
enriched for or containing dendritic cells. 

The recombinant adenoviruses provided herein have a variety of 
advantageous properties. The particles provided herein more efficiently 
20 infect dendritic cells than Ad5 particles or Ad5 particles that express 
subgroup B fibers, and hence are more immunogenic following direct in 
vivo administration. The vectors provided herein that efficiently target 
dendritic cells permit not only ex vivo delivery, but direct in vivo 
administration, thereby eliminating the need for removal of cells, ex vivo 
25 cell culture, and infusion. 

4. Additional Modifications 

The modified adenoviruses provided herein not only exhibit 
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improved tropism for dendritic cells, but also reduced binding to HSP, 
which is expressed on liver cells. The modified particles can be further 
modified to be detargeted from CAR, HSP, a v integrin, or any other native 
receptors, by any of the capsid mutations described below or well known 
5 to those of skill in the art. 

The vectors provided herein also can be modified by including a 
RGD peptide in the fiber protein. It has been shown (see, e.g., Okada et 
a/. (Cancer Res. 67:7913-7919 (2001)) that incorporation of an RGD 
peptide into the fiber protein increased infection of a murine DC line 

10 approximately two-fold. The cell line/Ad system was then used to 

evaluate anti-tumor responses in a mouse tumor xenograft model. When 
DCs were infected ex vivo using equal particle numbers of the wild type 
or modified vectors and then re-infused into mice, the modified vector 
was able to stimulate a significantly better immune response against the 

15 model antigen. 

The particles provided herein also can be further modified by 
inclusion of heterologous nucleic acid that provides a therapeutic product, 
and formulated for administration as vaccines. The adenovirus particles 
and vectors can deliver heterologous nucleic acids to dendritic cells to 

20 alter dendritic cell antigen presentation, cytockine production and other 
dendritic cell functions. 
D. Adenovirus vector detargeting 

Described below are modifications of the viral capsid that ablate 
the interaction of an adenovirus with its natural receptors. In particular, 

25 fiber modifications that result in ablation of the interaction of an adenvirus 
with HSP are described. These fiber modifications can be combined with 
other capsid protein modifications, such as other fiber modifications 
and/or penton and/or hexon modifications, to fully ablate viral interactions 
with natural receptors, when expressed on a viral particle. The 
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modification should not disrupt trimer formation or transport of fiber into 
the nucleus. The entire fiber of a serotype that binds to HSP can be 
replaced with all or a portion of a fiber that does not bind to HSP. 
Generally in such instances, the N-terminus of the replacing fiber is 
5 modified to resemble or to be identical to the replaced fiber to improve its 
incorporation into the viral particle. The number of amino acids at the N- 
terminus required can be empirically determined, but is typically between 
about 5-20, 10-17, 10-20, 10-50, 10-70, 10-100, amino acids, more 
amino acids can be included if convenient. The precise number also can 

10 be based upon the presence of convenient restriction sites in the 

encoding nucleic acid and other such considerations. Generally at least 
about 5-20, such as 16, 17, or 18, amino acids are required. 

The adenovirus fiber protein is a major determinant of adenovirus 
tropism (Gall et al. (1996) J. Virol. 70:2116-2123; Stevenson et al. 

15 (1995) J. Virol. 55:2850-2857). Dogma in the field has been that 
adenoviral entry occurs via binding to CAR and integrins. This is 
underscored by published data (Einfeld et al. (2001) J. Virology 
75'A 1 284-1 1 291 ). The published are not the predominant ones that act 
in vivo. The dominant entry pathway for hepatocytes in vivo involves a 

20 mechanism mediated by the fiber shaft, such as Ad5 shaft, through 
heparin sulfate proteoglycans binding (see, published U.S. application 
Nos. 2004-0002060 and 2003-0215948). 

Elimination of this binding eliminates entry via HSP binding, such as 
in hepatocytes. Adenoviral fiber shaft modifications that ablate viral 

25 interaction with HSP are described in the Examples below and in 

published U.S. application Nos. 2004-0002060 and 2003-0215948. 
Thus, efficient detargeting of adenovirus in vivo can be achieved with 
appropriately designed fiber proteins. Suitable modifications, such as 
described herein, can be made with respect to any adenovirus in which 
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the wild-type interacts with HSP. The ability of an adenoviral vector to 
interact with HSP is modified by replacing the fiber protein or at least the 
binding portion thereof with a fiber (or corresponding portion thereof) that 
does not bind to HSP thereby reducing or eliminating binding to HSP. 
5 This reduction or elimination of HSP binding can be manifested in vivo as 
reduced or eliminated transduction of liver cells in animals to whom the 
resulting viral particles are administered compared to the unmodifed 
particle. Modifications include insertions, deletions, individual amino acid 
mutations and other mutations that alter the structure of the fiber shaft 

10 such that the HSP binding of the modified fiber protein is ablated when 
compared to the HSP binding of the wild-type fiber protein. 

An adenoviral fiber protein is modified by mutating one or more of 
the amino acids that interact with HSP. For example, the HSP binding 
motif of the modified fiber protein is no longer able to interact with HSP 

15 on the cell surface, thus ablating the viral interaction with HSP. For 

example, the adenoviral fiber is from a subgroup C adenovirus. Binding to 
HSP can be eliminated or reduced by mutating the fiber shaft in order to 
modify the ability of the HSP binding motif, which is, for example, KKTK 
sequence (SEQ ID NO. 45) located between amino acid residues 91 to 94 

20 in the Ad5 fiber (SEQ ID NO. 2), to interact with HSP. The fiber proteins 
are modified by chemical and biological techniques known to those skilled 
in the art, such as site directed mutagenesis of nucleic acid encoding the 
fiber or other techniques as illustrated herein. 

In another aspect of this embodiment, the ability of a fiber to 

25 interact with HSP is modified by replacing the wild-type fiber shaft with a 
fiber shaft, or portion thereof, of an adenovirus that does not interact 
with HSP to produce chimeric fiber proteins. The portion is sufficient to 
reduce or eliminate interaction with HSP. Examples of adenoviruses 
having fiber shafts that do not interact with HSP include (a) adenoviruses 
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of subgroup B, such as, but are not limited to, Ad3, Ad7, Ad1 1 , Ad1 6, 
Ad21, Ad34 and Ad35 which do not have interaction with HSP, (b) 
adenoviruses of subgroup F, such as, but are not limited to, Ad40 and 
Ad41, specifically the short fiber, and (c) adenoviruses of subgroup D, 
5 such as but are not limited to, Ad19p, Ad30, Ad37 and Ad46. 

In another embodiment, adenoviral fiber shaft modifications and/or 
pseudotyped fibers that ablate viral interaction with HSP in combination 
with adenoviral fiber knob modifications that ablate viral interactions with 
CAR are provided. Suitable adenoviral fiber modifications include the 

10 fiber knob modifications described in the Examples below and 

modifications known to those of skill in the art (see published U.S. 
application Nos. 2004-002060 and 2003-0215948; see, also, US. Patent 
Application Serial No. 09/870,203, filed on May 30, 2001, published as 
U.S. Published application No. 20020137213, and International Patent 

15 Application No. PCT/EP01 /06286, filed on June 1, 2001, published as 
WO 01/92299). Modifications of the fiber include mutations of at least 
one amino acid in the CD loop of a wild-type fiber protein of an 
adenovirus from subgroup C (such as, e.g., Ad2 or Ad5), subgroup D 
(such as, e.g., Ad19p, Ad30 or Ad37), subgroup E, or the long wild-type 

20 fiber of an adenovirus from subgroup F, whereby the ability of a fiber 
protein to bind to CAR is reduced or substantially eliminated. The fiber 
proteins with ablated CAR interaction are modified by chemical and 
biological techniques known to those skilled in the art and as described 
herein. 

25 Alternatively, adenoviral fiber modifications are made by replacing 

the wild-type fiber knob with a fiber knob of an adenovirus that does not 
interact with CAR. The fiber protein also will be selected so that it does 
not interact with HSP. Examples of adenoviruses having fiber knobs that 
do not interact with CAR include (a) adenoviruses of subgroup B, e.g., 
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Ad3, Ad7, Ad11, Ad16, Ad21, Ad34, Ad35; and (b) adenoviruses of 
subgroup F, e.g., Ad40 and Ad41, specifically the short fiber. 

In another embodiment, adenoviral fiber shaft modifications and/or 
pseudotyped fibers that ablate viral interaction with HSP in combination 
5 with penton modifications that ablate viral interactions with a v integrins 
are provided. Suitable adenoviral penton modifications include the penton 
modifications, which are well known to those of skill in the art (see, e.g., 
U.S. Patent No. 5,731,190; see, also Einfeld eta/. (2001) J. Virology 
75:1 1284-1 1291; and Bai et al. (1 993) J. Virology 67:5198-5205). 

10 For example, penton interaction with a v integrins can be ablated 

(reduced or eliminated) by substitution of the RGD tripeptide motif, 
required for a v interaction, in penton with a different tripeptide that does 
not interact with an a v integrin. The penton proteins with ablated a v 
integrin interactions are modified by chemical and biological techniques 

15 known to those skilled in the art (see, e.g., described U.S. Patent No. 
6,731,190 and as illustrated herein). 

Also provided are adenoviral fiber shaft modifications or 
pseudotyped fibers that ablate viral interaction with HSP in combination 
with adenoviral fiber knob modifications that ablate viral interactions with 

20 CAR and with penton modifications that ablate viral interactions with or v 
integrins. These modifications are described above and prepared using 
chemical and biological techniques known to those skilled in the art and 
as illustrated herein. 

Preparation of fibers modified to eliminate or reduce HSP 

25 interactions and fibers modified to alter interactions with other receptors 
and cell surface proteins, such as CAR and/or a v integrin, also is 
described in the Examples below. The nucleic acid and/or amino acid 
sequences of exemplary modified fibers, whose construction are 
described below) are set forth as SEQ ID NOs. 3-30 as follows: 
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SEQ ID NOs. 3 and 4 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5FK01, where 
5F refers to adenovirus 5 fiber, KOI is an exemplary mutation of the CAR 
interaction site described herein; 
5 SEQ ID NOs. 5 and 6 set forth the encoding nucleotide sequence 

and amino acid sequence of the modified ber designated 5FK01RGD, 
which further includes an RGD ligand to demonstrate retargeting; 

SEQ ID NOs. 7 and 8 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5FK012, where 
10 5F refers to adenovirus 5 fiber, K012 is another exemplary mutation of 
the CAR interaction site described herein; 

SEQ ID NOs. 9 and 10 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5F S* nuc, 
where 5F refers to adenovirus 5 fiber, S* is an exemplary mutation of the 
15 shaft that alters binding to HSP; 

SEQ ID NOs. 1 1 and 12 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5F S*RGD nuc, 
which further includes an RGD ligand; 

SEQ ID NOs. 13 and 14 set forth the encoding nucleotide sequence 
20 and amino acid sequence of the modified ber designated 5FK01S*, which 
contain the K01 and S* mutations; 

SEQ ID NOs. 15 and 16 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5FK01S*RGD, 
which further includes an RGD ligand; 
25 SEQ ID NOs. 17 and 18 set forth the encoding nucleotide sequence 

and amino acid sequence of a Ad35 fiber; 

SEQ ID NOs. 19 and 20 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 35FRGD, which 
is 35F fiber with an RGD ligand; 
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SEQ ID NOs. 21 and 22 set forth the encoding nucleotide sequence 
and amino acid sequence of a Ad41 short fiber; 

SEQ ID NOs. 23 and 24 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 41 sFRGD, 
5 which is 41 F short fiber with an RGD ligand; 

SEQ ID NOs. 25 and 26 set forth the encoding nucleotide sequence 
and amino acid sequence of Ad5 penton; 

SEQ ID NOs. 27 and 28 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 5TS35H, which 
10 is a chimeric fiber in which an Ad5 fiber tail and shaft regions (5TS; 

amino acids 1 to 403) are connected to an Ad35 fiber head region (35H; 
amino acids 137 to 323) to form the 5TS35H chimera; and 

SEQ ID NOs. 29 and 30 set forth the encoding nucleotide sequence 
and amino acid sequence of the modified fiber designated 35TS5H, which 
15 is a chimeric fiber in which an Ad35 fiber tail and shaft regions (35TS; 
amino acids 1 to 136) are connected to an Ad5 fiber head region (5H; 
amino acids 404 to 581) to form the 35TS5H chimera. 

The modified fibers are displayed on virus particles by modifying 
the fiber protein and optionally additional proteins. This can be achieved 
20 by preparing adenoviral vectors that express the modified capsid proteins 
and produce particles with modified fibers, or by packaging adenoviral 
vectors, particularly those that do not encode one or more capsid proteins 
in appropriate packaging lines. Hence, as discussed in detail below, 
adenoviral vectors and viral particles with modified fibers that do not bind 
25 to HSP are provided. 
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Retargeting detargeted fibers 

The viral particles that are detargeted as described above, can be 
retargeted to selected cells and/or tissues by inclusion of an appropriate 
targeting ligand in the capsid. The ligand can be included in any of the 
5 capsid proteins, such as fiber, hexon and penton. Loci for inclusion of 
nucleic acid encoding a targeting ligand is known to those of skill in the 
art for a variety of adenovirus serotypes; if necessary appropriate loci and 
other parameters can be empirically determined. 

The ligand can be produced as a fusion by inclusion of the coding 
10 sequences in the nucleic acid encoding a capsid protein, or chemically 

conjugated, such as via ionic, covalent or other interactions, to the capsid 
or bound to the capsid {e.g., by Ab-ligand fusion, where Ab binds capsid 
protein; or by disulfide bonding or other crosslinking moieties or 
chemistries). 

15 Thus, for example, a modified fiber nucleic acid also can include 

sequences of nucleotides that encode a targeting ligand to produce viral 
particles that include a targeting ligand in the capsid. Targeting ligand 
and methods for including such ligands in viral capids are well known. 
For example, inclusion of targeting ligands in fiber proteins is described in 

20 U.S. Patent Nos. 5,543,328 and 5,756,086 and in U.S. Patent 
Application Serial No. 09/870,203, published as U.S. Published 
application No. 20020137213, and International Patent Application No. 
PCT/EP01/06286, published as WO 01/92299. For different serotypes 
and strains of adenoviruses, loci for insertion of targeting ligands can be 

25 empirically determined. For different serotypes and strains, such loci can 
vary. 

Because the adenovirus fiber has a trimeric structure, the ligand 
can be selected or designed to have a trimeric structure so that up to 
three molecules of the ligand are present for each mature fiber. Such 
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ligands can be incorporated into the fiber protein using methods known in 
the art (see, e.g., U.S. Patent No. 5,756,086). Instead of the fiber, the 
targeting ligand can be included in the penton or hexon proteins. 
Inclusion of targeting ligands in penton (see for example, in U.S. Patent 
5 Nos. 5,731,190 and 5,965,431) and in hexon (see for example, in U.S. 
Patent No. 5,965,541) is known. 

In one exemplary embodiment, the ligand is included in a fiber 
protein, which is a fiber protein mutated as described herein. The 
targeting ligand can be included, for example, within the HI loop of the 

10 fiber protein. Any ligand that can fit in the HI loop and still provide a 

functional virus is contemplated herein. Such ligands can be as long as or 
longer than 80-100 amino acids (see, e.g., Belousova et al. (2002) J. 
Virol. 75:8621-8631). Such ligands are added by techniques known in 
the art (see, e.g., published International Patent Application publication 

15 No. WO 99/39734 and U.S. Patent Application No. 09/482,682). Other 
ligands can be be discovered through techniques known to those skilled 
in the art. Some non-limiting examples of these techniques include phage 
display libraries or by screening other types of libraries. Such ligands 
include any that target dendritic cells. 

20 Targeting ligands include any chemical moiety that preferentially 

directs an adenoviral particle to a desired cell type and/or tissue, such as 
a dendritic cell. The categories of such ligands include, but are not 
limited to, peptides, polypeptides, single chain antibodies, and multimeric 
proteins. Specific ligands include the TNF superfamily of ligands which 

25 include tumor necrosis factors (or TNF's) such as, for example, TNFa and 
TNF/?, lymphotoxins (LT), such as LT-a and LT-yff, Fas ligand which binds 
to Fas antigen; CD40 ligand, which binds to the CD40 receptor of 
B-lymphocytes; CD30 ligand, which binds to the CD30 receptor of 
neoplastic cells of Hodgkin's lymphoma; CD27 ligand, NGF ligand, and 
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OX-40 Iigand; transferrin, which binds to the transferrin receptor located 
on tumor cells, activated T -cells, and neural tissue cells; ApoB, which 
binds to the LDL receptor of liver cells; alpha-2-macrogIobulin, which 
binds to the LRP receptor of liver cells; alpha-I acid glycoprotein, which 
5 binds to the asialoglycoprotein receptor of liver; mannose-containing 
peptides, which bind to the mannose receptor of macrophages; 
sialyl-Lewis-X antigen-containing peptides, which bind to the ELAM-I 
receptor of activated endothelial cells; CD34 Iigand, which binds to the 
CD34 receptor of hematopoietic progenitor cells; ICAM-I, which binds to 

10 the LFA-I (CD1 1 b/CD1 8) receptor of lymphocytes, or to the Mac-I 
(CD1 1a/CD18) receptor of macrophages; M-CSF, which binds to the 
c-fms receptor of spleen and bone marrow macrophages; circumsporo- 
zoite protein, which binds to hepatic Plasmodium falciparum receptor of 
liver cells; VLA-4, which binds to the VCAM-I receptor of activated 

15 endothelial cells; HIV gp120 and Class II MHC antigen, which bind to the 
CD4 receptor of T -helper cells; the LDL receptor binding region of the 
apolipoprotein E (ApoE) molecule; colony stimulating factor, or CSF, 
which binds to the CSF receptor; insulin-like growth factors, such as IGF-I 
and IGF-II, which bind to the IGF-I and IGF-I I receptors, respectively; 

20 Interleukins 1 through 14, which bind to the Interleukin 1 through 14 
receptors, respectively; the Fv antigen-binding domain of an 
immunoglobulin; gelatinase (MMP) inhibitor; bombesin, gastrin-releasing 
peptide; substance P; somatostatin; luteinizing hormone releasing 
hormone (LHRH); vasoactive peptide (VIP); gastrin; melanocyte 

25 stimulating hormone (MSH); cyclic RGD peptide and any other Iigand or 
cell surface protein-binding (or targeting) molecule. Such ligands can be 
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advantageously employed with the Ad5 particles pseudotyped with 
subgroup D adenovirus fiber, such as, for example, Adl9p, Ad30 or 
Ad37 fiber. 

E. Nucleic acids, adenoviral vectors and cells containing the nucleic 
5 acids and cells containing the vectors 

Also provided are polynucleotides that encode modified, including 

chimeric and/or heterologous, capsid proteins and that encode vectors for 

preparation of adenovirus that express modified capsid proteins provided 

herein. The sequences of the wild-type adenovirus proteins from many 

10 different adenovirus serotypes are well known in the art and are modified 
as described herein or by any suitable method. 

Also provided are vectors including the polynucleotides provided 
herein. Such vectors include partial or complete adenoviral genomes and 
plasmids. Such vectors are constructed by techniques known to those 

15 skilled in the art and as illustrated herein. Also provided are adenoviral 
vectors modified by replacing whole fiber protein, or portions thereof, 
with the fiber proteins, or appropriate portions thereof, from an 
adenovirus of a different serotype that more efficiently targets dendritic 
cells. Adenoviruses that target dendritic cells can be identified by using 

20 the methods described herein. Their fiber-encoding genes can then be 
used to pseudotype viruses, such as Ad5 or Ad2 and infection and gene 
delivery of adenoviruses with the heterologous or chimeric fibers can be 
detected. Among the adenoviral vectors provided herein are those of 
subgroup C, which include Ad2 and Ad5, in which the nucleic acid 

25 encoding the fiber knob and a portion or all of the fiber shaft domain is 

replaced with nucleic acid encoding fiber or an appropriate portion thereof 
from a subgroup D adenovirus, such as Ad19p, Ad30 or Ad37. 

Thus, adenoviral fiber modifications or substitutions can be made in 
viral particles by replacing the entire fiber protein, or a portion thereof, 
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with the fiber protein of an adenovirus that more efficiently binds to 
receptors on dendritic cells. Generally the heterologous adenovirus fiber 
is from a subgroup D adenovirus, such as Adl9p, Ad30 or Ad37. 
Adenoviral vectors of subgroup C, such as Ad2 and Ad5, having a 
5 replaced fiber knob are prepared using techniques well known in the art 
and as illustrated herein. 

In particular, as exemplified herein, the nucleic acid and/or amino 
acid sequences of exemplary heterologous and/or modified fibers for 
dendritic cell targeting are set forth as SEQ ID NOs. 31-44 as follows: 
10 SEQ ID NOs. 31 and 32 set forth the encoding nucleotide sequence 

and amino acid sequence of Ad37 fiber. 

SEQ ID NOs. 33 and 34 set forth the encoding nucleotide sequence 
and amino acid sequence of Ad19p fiber. 

SEQ ID NOs. 35 and 36 set forth the encoding nucleotide sequence 
15 and amino acid sequence of Ad30 fiber. 

SEQ ID NOs. 37 and 38 set forth the encoding nucleotide sequence 
and amino acid sequence of Ad16 fiber. 

SEQ ID NOs. 39 and 40 set forth the encoding nucleotide sequence 
and amino acid sequence of Ad35 fiber. 
20 SEQ ID NOs. 41 and 42 set forth the encoding nucleotide sequence 

and amino acid sequence of Ad5/Ad16 chimeric fiber. The chimeric fiber 
contains the N-terminal 17 amino acids from Ad5 and the remainder of 
the sequence is from Ad 16. 

SEQ ID NOs. 43 and 44 set forth the encoding nucleotide sequence 
25 and amino acid sequence of Ad5/Ad35 chimeric fiber. The chimeric fiber 
contains the N-terminal 17 amino acids from Ad5 and the remainder of 
the sequence is from Ad35. 
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1 . Preparation of viral particles 

The packaging cells used to produce the viruses provided herein 
contain the nucleic acid encoding the capsid (i.e. fiber, penton, hexon) 
protein. Such nucleic acid can be transfected into the cell, generally as 
5 part of a plasmid, or it can be infected into the cell with a viral vector. It 
can be stably incorporated into the genome of the cell, thus providing for 
a stable cell line. Alternatively, nucleic acid encoding the heterologous or 
mutated capsid protein can be removed from the genome, in which case a 
transient complementing cell is employed. 

10 The adenovirus genome to be packaged is transferred into the 

complementing cell by techniques known to those skilled in the art. 
These techniques include transfection or infection with the adenovirus. 
The nucleic acid encoding the mutated or heterologous fiber protein can 
be in this genome instead of in the packaging cell. 

15 In certain cases, when the nucleic acid in the genome to be 

packaged encodes a mutated or heterologous fiber protein, it can be 
desirable for the packaging cell to also encode a fiber protein. Such 
protein can assist in the maturation and packaging of an infectious 
particle. Such protein can be a wild-type fiber protein or one modified 

20 such that it is unable to attach to the penton base protein and is for use, 
for example, in producer cells where the fiber is included to provide the 
packaging function and the vector encodes a full-length fiber. 

The packaging cells are cultured under conditions that permit the 
production of the desired viral particle. The viral particles are recovered 

25 by standard techniques. An exemplary method for producing adenoviral 
particles provided herein is as follows. The nucleic acid encoding the 
mutated or heterologous capsid protein is made using standard techniques 
in an adenoviral shuttle plasmid. This plasmid contains the right end of 
the virus, in particular from the end of the E3 region through the right ITR. 
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This plasmid is co-transfected into competent cells of an E. coli strain, 
such as the well known E. coli strain BJ5183 (see, e.g., Degryse (1996) 
Gene 7 70:45-50) along with a plasmid, which contains the remaining 
portion of the adenovirus genome, except for the E1 region and 
5 sometimes also the E2a region and also contains a corresponding region 
of homology. Homologous recombination between the two plasmids 
generates a full-length plasmid encoding the entire adenoviral vector 
genome. 

This full-length adenoviral vector genome plasmid is then 

10 transfected into a complementing cell line. The transfection can be 
performed in the presence of a reagent that directs adenoviral particle 
entry into producer cells. Such reagents include, but are not limited to, 
polycations and bifunctional reagents, such as those described herein. A 
complementing cell, for example, is a cell of the PER.C6 cell line, which 

15 contains the adenoviral E1 gene (PER.C6 is available, for example, from 
Crucell, The Netherlands; deposited under ECACC accession no. 
96022940; see, also Fallaux et al. (1998) Hum. Gene Ther. 5:1909- 
1907; see, also, U.S. Patent No. 5,994,128) or an AE1-2a cell (see, 
Gorziglia et al. (1996) J. Virology 70:4173-4178; and Von Seggern et al. 

20 (1 998) J. Gen. Virol. 75:1461-1468)). 

AE1-2a cells are derivatives of the A549 lung carcinoma line 
(ATCC # CCL 185) with chromosomal insertions of the plasmids 
pGRE5-2.E1 (also referred to as GRE5-E1 -SV40-Hygro construct and 
listed in SEQ ID NO. 47) and pMNeoE2a-3.1 (also referred to as MMTV- 

25 E2a-SV40-Neo construct and listed in SEQ ID NO. 48), which provide 
complementation of the adenoviral E1 and E2a functions, respectively. 

The 633 cell line (see, von Seggern et al. (2000) J. Virology 
74:354-362), which stably expresses the adenovirus serotype 5 wild-type 
fiber protein, and was derived from the AE1-2a cell line, is another 
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example of complementing cells. When the cell line is 633 cell line, the 
final passage of the adenoviral vector is performed on another 
complementing cell line {e.g., Per.C6), which does not express wild-type 
Ad5 fiber. 

5 The transfected complementing cells are maintained under standard 

cell culture conditions. The adenoviral plasmids recombine to form the 
adenoviral genome that is packaged. The particles are infectious, but 
replication deficient because their genome is missing at least the E1 
genes. When performed in the 633 cells the particles contain wild-type 

10 and mutated or heterologous fiber proteins. They are recovered from the 
crude viral lysate, amplified, and are purified by standard techniques. 

The recovered particles can be used to infect PER.C6 or AE1-2a 
cells. This permits the recovery of particles whose capsids contain only 
the desired mutated fiber. This two-step procedure provides high titer 

15 batches of the adenoviral particles provided herein. The adenoviral 
particles can be replication competent or replication incompetent. 

In one embodiment, the particles selectively replicate in certain 
predetermined target tissue but are replication incompetent in other cells 
and tissues. In a particular embodiment, the adenoviral particles replicate 

20 in abnormally proliferating tissue, such as solid tumors and other 

neoplasms. In replication conditional adenoviruses, a gene essential for 
replication is placed under control of a heterologous promoter which is 
cell or tissue specific. For example, the E1a gene is placed under control 
of a promoter which is active in a tumor cell to produce an oncolytic 

25 adenovirus or oncolytic adenoviral vector. Administration of oncolytic 
adenoviral vectors to tumor cells kills the tumor cells. Such replication 
conditional adenoviral particles and vectors can be produced by 
techniques known to those skilled in the art, such as those disclosed in 
the above-referenced U.S. Patent Nos. 5,998,205 and 5,801,029. These 
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particles and vectors can be produced in adenoviral packaging cells as 
disclosed above. Generally packaging cells are those that have been 
designed to limit homologous recombination that could lead to wild-type 
adenoviral particles. Such cells are well known and include the packaging 
5 cell known as PER.C6 (see, e.g., U.S. Patent Nos. 5,994,128 and 
6,033,908; deposited under ECACC accession no. 96022940). 
2. Adenoviral vectors and particles 

The adenovirus as used herein for production of the adenoviral 
vectors and particles can be of any serotype, such as an Ad5 or Ad2. 

10 Adenoviral stocks that can be employed as a source of adenovirus or 
adenoviral coat protein, such as fiber and/or penton base, can be 
amplified from the adenoviral serotypes 1 through 51, which are currently 
available from the American Type Culture Collection (ATCC, Rockville, 
Md.), or from any other serotype of adenovirus available from any other 

15 source. For instance, an adenovirus can be of subgroup A (e.g., 

serotypes 12, 18, 31), subgroup B (e.g., serotypes 3, 7, 1 1 , 1 4, 1 6, 21 , 
34, 35, 50), subgroup C (e.g., serotypes 1, 2, 5, 6), subgroup D (e.g., 
serotypes 8, 9, 1 0, 1 3, 1 5, 1 7, 1 9, 20, 22-30, 32, 33, 36-39, 42-49, 
51), subgroup E (serotype 4), subgroup F (serotype 40, 41), or any other 

20 adenoviral serotype. In certain embodiments, the adenovirus is a 

subgroup C adenovirus. Subgroup C adenoviruses which are modified in 
as described herein, include, but are not limited to, Ad2 and Ad5. 

The adenoviral vectors provided herein can be used to study cell 
transduction and gene expression in vitro or in various animal models. 

25 The latter case includes ex vivo techniques, in which cells are transduced 
in vitro and then administered to the animal. They also can be used to 
conduct gene therapy on humans or other animals. Such gene therapy 
can be ex vivo or in vivo. For in vivo gene therapy, the adenoviral 
particles in a pharmaceutically-acceptable carrier are delivered to a human 
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in a therapeutically effective amount in order to prevent, treat, or 
ameliorate a disease or other medical condition in the human through the 
introduction of a heterologous gene that encodes a therapeutic protein 
into cells in such human. The adenoviruses are delivered at a dose 
5 ranging from approximately 1 particle per kilogram of body weight to 

approximately 10 14 particles per kilogram of body weight. Generally, they 
are delivered at a dose of approximately 10 6 particles per kilogram of 
body weight to approximately 10 13 particles per kilogram of body weight, 
and typically the dose ranges from approximately 10 8 particles per 
10 kilogram of body weight to approximately 10 12 particles per kilogram of 
body weight. 

Any vectors known to those of skill in the art can be employed and 
used to produce viral particles that include fibers modified to enhance 
binding and infectivity of dendritic cells. 

15 a. Gutless vectors 

Gutted adenovirus vectors are those from which most or all viral 
genes have been deleted. They are grown by co-infection of the 
producing cells with a "helper" virus (such as using an El-deleted Ad 
vector), where the packaging cells express the E1 gene products. The 

20 helper virus trans-complements the missing Ad functions, including 

production of the viral structural proteins needed for particle assembly. 
To incorporate the capsid modifications into a gutted adenoviral vector 
capsid, the changes must be made to the helper virus as described herein. 
All the necessary Ad proteins including the modified capsid protein are 

25 provided by the modified helper virus, and the gutted adenovirus particles 
are equipped with the particular modified capsid expressed by the host 
cells. The E1a, Eb, E2a, E2b and E4 are generally required for viral 
replication and packaging. If these genes are deleted, then the packaging 
cell must provide these genes or functional equivalents. 
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A helper adenovirus vector genome and a gutless adenoviral vector 
genome are delivered to packaging cells. The cells are maintained under 
standard cell maintenance or growth conditions, whereby the helper 
vector genome and the packaging cell together provide the 
5 complementing proteins for the packaging of the adenoviral vector 
particle. Such gutless adenoviral vector particles are recovered by 
standard techniques. The helper vector genome can be delivered in the 
form of a plasmid or similar construct by standard transfection 
techniques, or it can be delivered through infection by a viral particle 

10 containing the genome. Such viral particle is commonly called a helper 
virus. Similarly, the gutless adenoviral vector genome can be delivered to 
the cell by transfection or viral infection. 

The helper virus genome can be the modified adenovirus vector 
genome as disclosed herein. Such genome also can be prepared or 

15 designed so that it lacks the genes encoding the adenovirus E1A and E1B 
proteins. In addition, the genome can further lack the adenovirus genes 
encoding the adenovirus E3 proteins. Alternatively, the genes encoding 
such proteins can be present but mutated so that they do not encode 
functional E1A, E1B and E3 proteins. Furthermore, such vector genome 

20 can not encode other functional early proteins, such as E2A, E2B3, and 
E4 proteins. Alternatively, the genes encoding such other early proteins 
can be present but mutated so that they do not encode functional 
proteins. 

In producing the gutless vectors, the helper virus genome also is 
25 packaged, thereby producing helper virus. In order to minimize the 

amount of helper virus produced and maximize the amount of gutless 
vector particles produced, the packaging sequence in the helper virus 
genome can be deleted or otherwise modified so that packaging of the 
helper virus genome is prevented or limited. Since the gutless vector 



WO 2004/099422 



PCT/US2004/009219 



-62- 

genome will have an unmodified packaging sequence, it will be 
preferentially packaged. 

One way to do this is to mutate the packaging sequence by 
deleting one or more of the nucleotides comprising the sequence or 
5 otherwise mutating the sequence to inactivate or hamper the packaging 
function. One exemplary approach is to engineer the helper genome so 
that recombinase target sites flank the packaging sequence and to 
provide a recombinase in the packaging cell. The action of recombinase 
on such sites results in the removal of the packaging sequence from the 

10 helper virus genome. The recombinase can be provided by a nucleotide 
sequence in the packaging cell that encodes the recombinase. Such 
sequence can be stably integrated into the genome of the packaging cell. 
Various kinds of recombinase are known by those skilled in the art, and 
include, but are not limited to, Cre recombinase, which operates on 

15 so-called lox sites, which are engineered on either side of the packaging 
sequence as discussed above (see, e.g., U.S. Patent Nos. 5,919, 676, 
6,080,569 and 5,919,676; see, also, e.g., Morsy and Caskey, 
Molecular Medicine Today, Jan. 1999, pgs. 18-24). 

An example of a gutless vector is pAdARSVDys (Haecker et al. 

20 (1996) Hum Gene Then 7:1907-1914)). This plasmid contains a 
full-length human dystrophin cDNA driven by the RSV promoter and 
flanked by Ad inverted terminal repeats and packaging signals. 293 cells 
are infected with a first-generation Ad, which serves as a helper virus, 
and then transfected with purified pAdARSVDys DNA. The helper Ad 

25 genome and the pAdARSVDys DNA are replicated as Ad chromosomes, 
and packaged into particles using the viral proteins produced by the 
helper virus. Particles are isolated and the pAdARSVDys-containing 
particles separated from the helper by virtue of their smaller genome size 
and therefore different density on CsCI gradients. Other examples of 
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gutless adenoviral vectors are known (see, e.g., Sandig et al. (2000) 
Proc. Natl. Acad. Sci. U.S.A. 97(3)'AQ02-7). 
b. Oncolytic vectors 
Oncolytic adenoviruses are viruses that replicate selectively in 
5 tumor cells. Such vectors generally will not be useful for targeting 

dendritic cells, unless such cells are malignant. Briefly, oncolytic vectors 
are designed to amplify the input virus dose due to viral replication in the 
tumor, leading to spread of the virus throughout the tumor mass. In situ 
replication of adenoviruses leads to cell lysis. This in situ replication 

10 permits relatively low, non-toxic doses to be highly effective in the 

selective elimination of tumor cells. One approach to achieving selectivity 
is to introduce loss-of-function mutations in viral genes that are essential 
for growth in non-target cells but not in tumor cells (see, e.g., U.S. Patent 
No. 5,801,029). This strategy is exemplified by the use of Addl1520, 

15 which has a deletion in the Elb-55KD gene. In normal cells, the 
adenoviral Elb-55KD protein is needed to bind to p53 to prevent 
apoptosis. In p53-deficient tumor cells, E1b-55K binding to p53 is 
unnecessary. Thus, deletion of E1b-55KD should restrict vector 
replication to p53-deficient tumor cells. 

20 Another approach is to use tumor-selective promoters to control 

the expression of early viral genes required for replication (see, e.g., 
International PCT application Nos. WO 96/17053 and WO 99/25860). 
Thus, in this approach the adenoviruses selectively replicate and lyse 
tumor cells if the gene that is essential for replication is under the control 

25 of a promoter or other transcriptional regulatory element that is 
tumor-selective. 

For example oncolytic adenoviral vectors that contain a cancer 
selective regulatory region operatively linked to an adenoviral gene 
essential for adenoviral replication are known (see, e.g., U.S. Patent No. 
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5,998,205). Adenoviral genes essential for replication include, but are 
not limited to, E1a, E1b, E2a, E2b and E4. For example, an exemplary 
oncolytic adenoviral vector has a cancer selective regulatory region 
operatively linked to the El a gene. In other embodiments, the oncolytic 
5 adenoviral vector has a cancer selective regulatory region of the present 
invention operatively linked to the E1a gene and a second cancer 
selective regulatory region operatively linked to the E4 gene. The vectors 
also can include at least one therapeutic transgene, such as, but not 
limited to, a polynucleotide encoding a cytokine such as GM-CSF that can 

10 stimulate a systemic immune response against tumor cells. 

Other exemplary oncolytic adenoviral vectors include those in 
which expression of an adenoviral gene, which is essential for replication, 
is controlled by E2F-responsive promoters, which are selectively 
transactivated in cancer cells. Thus, vectors that contain an adenoviral 

15 nucleic acid backbone that contain in sequential order: A left ITR, an 
adenoviral packaging signal, a termination signal sequence, an E2F 
responsive promoter which is operably linked to a first gene, such as E1a, 
essential for replication of the recombinant viral vector and a right ITR 
(see, published International PCT application No. WO02/06786, and U.S. 

20 Patent No. 5,998,205). 

In other embodiments, the oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the E1a gene and a 
second cancer selective regulatory region operatively linked to the E4 
gene. The vectors also can carry at least one therapeutic transgene, such 

25 as, but not limited to, a polynucleotide encoding a cytokine such as 

GM-CSF that can stimulate a systemic immune response against tumor 
cells. 
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3. Packaging 

The viral particles provided herein can be made by any method 
known to those of skill in the art. Generally they are prepared by growing 
the adenovirus vector that contains nucleic acid that encodes the 
5 modified or heterologous capsid protein in standard adenovirus packaging 
cells to produce particles that express the modified or heterologous capsid 
proteins. Alternatively, the vectors do not encode fiber proteins. Such 
vectors are packaged in producer cells to produce particles that express 
the modified fiber proteins. 

10 As discussed, recombinant adenoviral vectors generally have at 

least a deletion in the first viral early gene region, referred to as E1, which 
includes the E1a and E1b regions. Deletion of the viral E1 region renders 
the recombinant adenovirus defective for replication and incapable of 
producing infectious viral particles in subsequently infected target cells. 

15 Thus, to enable E1 -deleted adenovirus genome replication and to produce 
virus particles requires a system of complementation which provides the 
missing E1 gene product. E1 complementation is typically provided by a 
cell line expressing E1, such as the human embryonic kidney packaging 
cell line, i.e. an epithelial cell line, called 293. Cell line 293 contains the 

20 El region of adenovirus, which provides E1 gene region products to 
"support" the growth of E1 -deleted virus in the cell line (see, e.g., 
Graham et aL, J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 
that may be usable for production of defective adenovirus having a 
portion of the adenovirus E4 region have been reported (WO 96/22378). 

25 Multiply deficient adenoviral vectors and complementing cell lines have 
also been described (WO 95/34671, U.S. Patent No. 5,994,106). 

For example, copending U.S. application Serial No. 09/482,682 
(also filed as International PCT application No. PCT/EP00/00265, filed 
January 14, 200, published as International PCT application No. 
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WO/0042208) provides packaging cell lines that support viral vectors 
with deletions of major portions of the viral genome, without the need for 
helper viruses and also provides cell lines and helper viruses for use with 
helper-dependent vectors. The packaging cell line has heterologous DNA 
5 stably integrated into the chromosomes of the cellular genome. The 
heterologous DNA sequence encodes one or more adenovirus regulatory 
and/or structural polypeptides that complement the genes deleted or 
mutated in the adenovirus vector genome to be replicated and packaged. 
Packaging cell lines express, for example, one or more adenovirus 

10 structural proteins, polypeptides, or fragments thereof, such as penton 
base, hexon, fiber, polypeptide Ilia, polypeptide V, polypeptide VI, 
polypeptide VII, polypeptide VIII, and biologically active fragments 
thereof. The expression can be constitutive or under the control of a 
regulatable promoter. These cell lines are particularly designed for 

15 expression of recombinant adenoviruses intended for delivery of 

therapeutic products. For use herein, such packaging cell lines can 
express the modified or heterologous capsid proteins, such as the fiber 
proteins whose binding and infection of dendritic cells is enhanced. 

Particular packaging cell lines complement viral vectors having a 

20 deletion or mutation of a DNA sequence encoding an adenovirus 

structural protein, regulatory polypeptides E1A and E1B, and/or one or 
more of the following regulatory proteins or polypeptides: E2A, E2B, E3, 
E4, L4, or fragments thereof. 

The packaging cell lines are produced by introducing each DNA 

25 molecule into the cells and then into the genome via a separate 

complementing plasmid or plurality of DNA molecules encoding the 
complementing proteins can be introduced via a single complementing 
plasmid. Of interest herein, is a variation in which the complementing 
plasmid includes DNA encoding adenovirus fiber protein (or a chimeric or 
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modified variant thereof), from Ad virus of subgroup D, such as Ad19p or 
Ad37. 

For applications, such as therapeutic applications, the delivery 
plasmid further can include a nucleotide sequence encoding a 
5 heterologous polypeptide. Exemplary delivery plasmids include, but are 
not limited to, pDV44, pAE1B£-gal and pAE1sp1B (Microbix Biosystems; 
see also, U.S. Patent No. 6,140,087 and U.S. Patent No. 6,379,943). In 
a similar or analogous manner, therapeutic nucleic acids, such as nucleic 
acids that encode therapeutic genes, can be introduced. 
10 The cell further includes a complementing plasmid encoding a fiber 

or other capsid protein as contemplated herein; the plasmid or portion 
thereof is integrated into a chromosome(s) of the cellular genome of the 
cell. 

Typically, the packaging cell lines will contain nucleic acid encoding 

15 the capsid protein or modified capsid protein stably integrated into a 

chromosome or chromosomes in the cellular genome. The packaging cell 

line can be derived from a procaryotic cell line or from a eukaryotic cell 

line. While various embodiments suggest the use of mammalian cells, 

and more particularly, epithelial cell lines, a variety of other, non-epithelial 

20 cell lines are used in various embodiments. Thus, while various 

embodiments disclose the use of a cell line selected from among the 293, 

A549, W162, HeLa, Vero, 211, and 211 A cell lines, any other cell lines 

suitable for such use are likewise contemplated herein. 

4. Propagation and Scale-up of doubly-ablated adenoviral 
25 vectors 

Since doubly ablated adenoviral vectors containing mutations in the 

fiber and/or penton capsid proteins result in inefficient cell binding and 

entry via the CAR/av integrin entry pathway, scaled up technologies 

improve the growth and propagation of such vectors to produce high 



WO 2004/099422 



PCT/US2004/009219 



-68- 

titers of the adenoviral vectors for clinical use. Thus, also provided is a 
method for scaling up the production of detargeted adenoviral vectors. 
The detargeted adenoviral vectors comprise an adenoviral vector modified 
to ablate the interaction of said vector with at least one host cell receptor 
5 compared with a wild-type adenoviral vector. The detargeted adenoviral 
vectors can comprise an adenoviral vector modified to ablate the 
interaction of said vector with one, two, three or more host cell receptors. 
Thus, the method is suitable for producing the detargeted adenoviral 
vectors disclosed herein. 

10 As noted, growth and propagation of doubly and fully ablated 

adenoviral vectors is enhanced by new scale up technologies. Doubly 
ablated vectors contain mutations in the fiber and penton capsid proteins 
that result in inefficient cell binding and entry via the normal cellular entry 
pathway using CAR and integrins. These vectors are fully detargeted in 

15 vitro and, thus, alternative cellular entry strategies allow for the efficient 
growth and generation of high titer preparations. 

Two strategies have been envisioned to scale up vectors that are 
detargeted via fiber and/or penton modifications. These include: (a) the 
use of pseudoreceptor cell lines engineered to express a surface receptor 

20 that binds a ligand displayed on the vector (see, e.g., International PCT 
application No. WO 98/54346) and (b) complementing cell lines that are 
engineered to express native fiber and that can be engineered to express 
native fiber and penton (see, e.g., International PCT application No. WO 
00/42208). Although these systems have shown promise for scaling up 

25 ablated adenoviral vectors, there is a need to develop a system for the 
simple, efficient production of the fully detargeted adenoviral vector for 
therapeutic uses. 
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Provided herein is a scale-up method for the propagation of 
detargeted adenoviral vectors. The method uses polycations and/or 
bifunctional reagents, which when added to tissue culture medium, bind 
adenoviral particles and direct their entry into the producer cells. 
5 Reagents (also called medium additives) also can be included in the 

tissue culture medium containing producer cells to be infected with the 
detargeted adenoviral vectors. Alternatively the reagents can be pre- 
mixed with the virus, which mixture is then added to the tissue producer 
cells. The reagents can be added to tissue culture medium containing 

10 producer cells, or producer cells can be added to tissue culture medium 
containing the reagents. Any suitable producer cell known to the skilled 
artisan can be used in the present methods. The reagents can be added 
at the same time that the producer cells are infected with detargeted 
adenoviral vectors. Generally the reagents are present in the tissue 

15 culture medium prior to infection by the detargeted adenoviral vectors. 
The medium additives are maintained in the tissue culture medium during 
vector growth, spread and propagation. High titer yields of adenoviral 
vectors are obtained by this method. 

Reagents which are useful in this method are those that are 

20 capable of directing adenoviral particle entry into the producer cells. Such 
reagents include, but are not limited to, polycations and bifunctional 
reagents. Suitable polycations include, but are not limited to, 
polytheylenimine; protamine sulfate; poly-L-lysine hydrobromide; 
poly(dimethyl diallyl ammonium) chloride (Merquat(r)-1 00, Merquat(r)280, 

25 Merquat(r)550); poly-L-arginine hydrochloride; poly-L-histidine; 

poly(4-vinylpyridine), poly(4-vinylpyridine) hydrochloride; po!y(4-vinyl- 
pyridine)cross-linked, methylchloride quaternary salt; poly(4-vinyl- 
pyridine-co-styrene); poly(4-vinylpyridinium poly(hydrogen fluoride)); 
po!y(4-vinylpyridinium-P-toluene sulfonate); poly(4-vinylpyridinium-tribro- 
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mide); poIy(4-vinylpyrrolidone-co-2-dimethy!amino-ethyl methacrylate); 
polyvinylpyrrolidone, cross-linked; poly vinylpyrrolidone, poly(meIamine- 
co-formaldehyde); partially methylated; hexadimethrine bromide; poly(Glu, 
Lys) 1:4 hydrobromide; poly(Lys, Ala) 3:1 hydrobromide; poly(Lys, Ala) 
5 2:1 hydrobromide; poly-L-lysine succinylated; poly(Lys, Ala) 1:1 hydro- 
bromide; and po!y(Lys, Trp) 1 :4 hydrobromide. 

Suitable bifunctional reagents include, but are not limited to, 
antibodies or peptides that bind to the adenoviral capsid and that also 
contain a ligand that allows interaction with specific cell surface receptors 

10 of the producer cells. Examples of bifunctional reagents include: (a) 
anti-fiber antibody ligand fusions, (b) anti-fiber-Fab-FGF conjugate, (c) 
anti-penton-antibody ligand fusions, (d) anti-hexon antibody ligand fusions 
and (e) polylysine-peptide fusions. The ligand is any ligand that will bind 
to any cell surface receptor found on the producer cells. 

15 F. Adenovirus Expression Vector Systems 

The adenovirus vector genome that is encapsulated in the virus 
particle and that expresses exogenous genes in a gene therapy setting is 
provided. The components of an recombinant adenovirus vector genome 
include the ability to express selected adenovirus structural genes, to 

20 express a desired exogenous protein, and to contain sufficient replication 
and packaging signals that the genome is packaged into a gene delivery 
vector particle. An exemplary replication signal is an adenovirus inverted 
terminal repeat containing an adenovirus origin of replication, as is well 
known and described herein. Although adenovirus include many proteins, 

25 not all adenovirus proteins are required for assembly of a recombinant 

adenovirus particle (vector). Thus, deletion of the appropriate genes from 
a recombinant Ad vector permits accommodation of even larger "foreign" 
DNA segments. 
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One recombinant adenovirus vector genome is "helper 
independent" so that genome can replicate and be packaged without the 
help of a second, complementing helper virus. Complementation is 
provided by a packaging cell. Particuarly contemplated are helper 
5 dependent systems. In an exemplary embodiment, the adenovirus vector 
genome does not encode a functional adenovirus fiber protein. A non- 
functional fiber gene refers to a deletion, mutation or other modification to 
the adenovirus fiber gene such that the gene does not express any or 
insufficient adenovirus fiber protein to package a fiber-containing 

10 adenovirus particle without complementation of the fiber gene by a 

complementing plasmid or packaging cell line. Such a genome is referred 
to as a "fiberless" genome, not to be confused with a fiberless particle. 
Alternatively, a fiber protein may be encoded but is insufficiently 
expressed to result in a fiber containing particle. 

15 Thus, contemplated for use are helper-independent fiberless 

recombinant adenovirus vector genomes that include genes that 
(a) epxress all adenovirus structural gene products but express insufficient 
adenovirus fiber protein to package a fiber-containing adenovirus particle 
without complementation of said fiber gene, (b) express an exogenous 

20 protein, and (c) contains an adenovirus packaging signal and inverted 
terminal repeats containing adenovirus origin of replication. 

The adenovirus vector genome is propagated in vitro in the form of 
rDNA plasmids containing the genome, and upon introduction into an 
appropriate host, the viral genetic elements provide for viral genome 

25 replication and packaging rather than plasmid-based propogation. 

Exemplary methods for preparing an Ad-vector genome are described in 
the Examples. 

A vector herein includes a nucleic acid (such as DNA) molecule 
capable of autonomous replication in a cell and to which a DNA segment, 
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e.g. r a gene or polynucleotide, can be operatively linked to bring about 
replication of the attached segment. For purposes herein, one of the 
nucleotide segments to be operatively linked to vector sequences encodes 
at least a portion of a therapeutic nucleic acid molecule. As noted above, 
5 therapeutic nucleic acid molecules include those encoding proteins and 
also those that encode regulatory factors that can lead to expression or 
inhibition or alteration of expression of a gene product in a dendritic cell. 
1 . Nucleic Acid Gene Expression Cassettes 
In various embodiments, a peptide-coding sequence of the 

10 therapeutic gene is inserted into an expression vector and expressed; 

however, it also is feasible to construct an expression vector which also 
includes some non-coding sequences as well. Generally, however, 
non-coding sequences are excluded. Alternatively, a nucleotide sequence 
for a soluble form of a polypeptide may be utilized. Another therapeutic 

15 viral vector includes a nucleotide sequence encoding at least a portion of 
a therapeutic nucleotide sequence operatively linked to the expression 
vector for expression of the coding sequence in the therapeutic nucleotide 
sequence. 

The choice of viral vector into which a therapeutic nucleic acid 
20 molecule is operatively linked depends directly, as is well known in the 
art, on the functional properties desired, e.g., vector replication and 
protein expression, and the host cell to be transformed — these being 
limitations inherent in the art of constructing recombinant DNA molecules. 
Although certain adenovirus serotypes are recited herein in the form of 
25 specific examples, it should be understood that the use of any adenovirus 
serotype, including hybrids and derivatives thereof are contemplated. Of 
particular interest, is the use of fiber that targets the resulting viral 
particle to dendritic cells. 
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2. Promoters 

As noted elsewhere herein, an expression nucleic acid in an Ad- 
derived vector also include a promoter, particularly a tissue or cell specific 
promoter, such as one expressed dendritic cells. Promoters are nucleic 
5 acid fragments that contain a DNA sequence that controls the expression 
of a gene located 3' or downstream of the promoter. The promoter is the 
DNA sequence to which RNA polymerase specifically binds and initiates 
RNA synthesis (transcription) of that gene, typically located 3' of the 
promoter. A promoter also includes DNA sequences which direct the 

10 initiation of transcription, including those to which RNA polymerase 
specifically binds. If more than one nucleic acid sequence encoding a 
particular polypeptide or protein is included in a therapeutic viral vector or 
nucleotide sequence, more than one promoter or enhancer element may 
be included, particularly if that would enhance efficiency of expression. 

15 Regulatable (inducible) as well as constitutive promoters may be used, 
either on separate vectors or on the same vector. For example, some 
useful regulatable promoters are those of the CREB-regulated gene family 
and include inhibin, gonadotropin, cytochrome c, glucagon and other. 
(See, e.g., International PCT application No. WO 96/14061). The 

20 promoter selected can be selected from a dendritic cell-specific gene, 
such as NF/cB. 

A regulatable or inducible promoter is a promoter where the rate of 
RNA polymerase binding and initiation is modulated by external stimuli, 
(see, e.g., U.S. Patent Nos. 5,750,396 and 5,998,205). Such stimuli 
25 include various compounds or compositions, light, heat, stress, chemical 
energy sources, and the like. Inducible, suppressible and repressible 
promoters are considered regulatable promoters. Regulatable promoters 
also can include tissue-specific promoters. Tissue-specific promoters 
direct the expression of the gene to which they are operably linked to a 
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specific cell type. Tissue-specific promoters cause the gene located 3' of 
it to be expressed predominantly, if not exclusively, in the specific cells 
where the promoter expressed its endogenous gene. Typically, it appears 
that if a tissue-specific promoter expresses the gene located 3' of it at 
5 all, then it is expressed appropriately in the correct cell types (see, e.g. , 
Palmiter et al. (1986) Ann. Rev. Genet. 20: 465-499). 
G. Heterologous Polynucleotides and Therapeutic Nucleic Acids 

The packaged adenoviral genome also can contain a heterologous 
polynucleotide that encodes a product of interest, such as a therapeutic 

10 protein. Adenoviral genomes containing heterologous polynucleotides are 
well known (see, e.g., U.S. Patent Nos. 5,998,205, 6,156,497, 
5,935,935, and 5,801,029). These can be used for in vitro and in vivo 
delivery of the products of heterologous polynucleotides or the 
heterologous polynucleotides. 

15 The adenoviral particles provided herein can be used to engineer a 

cell to express a protein that it otherwise does not express or does not 
express in sufficient quantities. This genetic engineering is accomplished 
by infecting the desired cell with an adenoviral particle whose genome 
includes a desired heterologous polynucleotide. The heterologous 

20 polynucleotide is then expressed in the genetically engineered cells. For 
use herein the cell is generally a mammalian cell, and is typically a primate 
cell, including a human cell. The cell can be inside the body of the animal 
(in vivo) or outside the body (in vitro). Heterologous polynucleotides (also 
referred to as heterologous nucleic acid sequences) are included in the 

25 adenoviral genome within the particle and are added to that genome by 
techniques known in the art. Any heterologous polynucleotide of interest 
can be added, such as those disclosed in U.S. Patent No. 5,998,205, 
incorporated herein by reference. 



WO 2004/099422 



PCT/US2004/009219 



-75- 

Polynucleotides that are introduced into an Ad genome or vector can 
be any that encode a protein of interest or that are regulatory sequences. 
In particular, the genomes can include heterologous nucleic acid encoding 
a product for expression in a dendritic cell for presentation or to alter the 
5 activity of the dendritic cell. For purposes herein, proteins include, but 
are not limited to tumor antigens. Tumor antigens included, but are not 
limited to carcinoembryonic antigen, NY-BR1, NY-ESO-1, MAGE-1, 
MAGE-3, BAGE, GAGE, SCP-1, SSX-1, SSX-2, SSX-4, CT-7, Her2/Neu, 
NY-BR-62, NY-BR-85 and tumor protein D52 (Scanlan and Jager (2001) 
10 Breast Cancer Res. 3:95-98; Yu and Restifo (2002) J. Clin. Invest. 



7 70:289-94). The following Table includes an exemplary list of tumor 
antigens and tissues expressing such antigens. 



Antigen 


Tissue 


Oncofetal 




OPA 


Fetal pancreas 


CEA 


Colon, Rectal, Stomach, Lung, Pancreas, Kidney, Bladder, 
Head & Neck, Cervical, endometrial, ovarian, Breast 


POA 


Fetal pancreas 


FAP 


Fetal pancreas 


PA8-15 


Pancreatic cancer 
cell line SUIT-2 


Adult 




CA 50 


Colorectal 
carcinoma cell 
line 


CA 1 9-9 


Colon carcinoma 
cell line SW1 116 


CA 242 


Colorectal 
carcinoma cell 
line COLO 205 


CAR-3 


Epidermoid 
carcinoma cell 
line A 431 
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15 



Antigen 


Tissue 


DU-PAN-2 


Pancreatic 




carcinoma cell 




line HPAF 


Ypan-1 


Pancreatic 




carcinoma cell 




line SW1990 


Span-1 


ii 


BW494 


Pancreatic tumor 




tissue 


MUSE 11 


Gastric cancer 




ascites fluid 


■ 


Embryonal carcinoma cells 


Le a 


Colon adenocarcinoma 


Fuc-L A1 


Pancreatic adenocarcinoma 


Le b 


Colon 




adenocarcinoma 




Pancreatic 




adenocarcinoma 


3-isoLun 


Small cell lung 




carcinoma 




Glioma 




Medulloblastoma 




Teratocarcinoma 




ceils 


3',6'hsoLdt 


Liver metastasis 




of colon cancer 




Embryonal 




parrinnma ppII*? 


Fuc-3'- 


Gastrointestinal 


isoL M1 


cancer 


Siaiylated 




Le a 




Fuc-3',6'- 


Human colon 


isoL D1 


adenocarcinoma 


Disialylated 




Le a 




nL A1 


Colon cancer 


i-Antigen 


Lung cancer 
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Antigen 


Tissue 




SSEA-1 
Le x 

Fuc-nL A1 


Teratocarcinoma 
Colon cancer 




Dimeric Le x 


Adenocarcinoma 
Colon cancer 
Liver cancer 


5 


Le v 


Gastric cancer 
Breast cancer 
Colon cancer 




6'-L M1 


Colorectal 

carcinoma 
Lung carcinomas 
Primary 

hepatoma 


1 u 


Sialylated 
Le x 
or 

Fuc-3 -L M1 


Gastrointestinal 
cancer 

Lung carcinoma 






Gastric 
colon 
lung 
breast 

renal cancers 




GB3 
Globo-H 


Burkitt's lymphoma 
breast cancer 




Sulfatide 


Mucinous cystadenocarcinoma, 


15 


Disulfated G A1 


Hepatocellular carcinoma 




N-Glycolylneuraminic 
acid 


Colon cancer 




N-GlycoIyl-G M2 


N-Glycolyl-G M2 


20 


G M2 

OFA-l-1 
OFA-l-2 


Melanoma 






Glioma 






Germ cell tumors 


25 




Melanoma 






Neuroblastoma 
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Antigen 


Tissue 






Small cell lung carninoma 






Glioma 


5 


G M3 

Ag 

FCM1 

2-39 

IF43 

gp-100 melanoma- 
associated antigen 


Melanoma 


I U 


G D3 


Melanoma 




11 iivyi i 

nJM i 


Melanoma 






Medulloblastoma 






Glioma 






Leukemia 


15 




Meninglioma 




9-0-Acetyl-G D3 


Melanoma 




Fuc-G M1 


Small cell lung carcinoma 




COTA 


Colon, ovarian \ 


20 


SW1038 
CTS 


Colon 
prostate 




MAGE-1 MAGE-2 
MAGE-3 (MZ2-E MZ2- 
Bb) 


Lung 

melanocyte 
breast 




MUC-1 


Breast 
pancreas 


25 


Lewis-Ag (GICA) 


Ovarian 
myelin 




TAG-12 


Breast 
ovarian 




TAG-72 


colon 

ovarian pancrease 


30 


Orfan-specific cancer 
neoantigen (OSN) 


Lung 




GP100 


Melanocyte 
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Antigen 


Tissue 


MART-1 


Melanocyte 


p95/p97 


Melanocyte 


EGF receptor 


Squamous tumors 


CA125 


Ovary 




Breast 


p97 

(melanotransferrin) 


Melanocyte 


22-1-1 


uterus 
cervix 
ovary 


GA733 


gastrointestinal 
carcinoma 


YH206 


adenocarcinomas 


MART-2 


melanocytes 


D/AVJ tZ I 


rneidnooyies 


uMO C 1 -O 


meiaocyie 




osteocarcoma 


Uro 


Breast 




lymphocytes 


Lop4U-OU 

L3p90 


Lung 


i nomsen-rneuenricn 
Pan Tumor Antigen 


pancarcinoma 




|Jal IOI CClo 




ovarian 


EPB-2 


B cell lymphoma 




melanoma 




lymphoma 




medullary thyroid 
carcinoma 




gastrointestinal 
carcinoma 
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Antigen 


Tissue 


NS-ESO-1 


melanoma, breast, bladder, prostate, heptocellular i 
carcinoma 


NY-ESO-1 


melanoma, breast, bladder, prostate, heptocellular 
carcinoma 



Proteins also include, but are not limited to, therapeutic proteins, 
5 such as an immunostimulating protein, such as an interleukin, interferon, 
or colony stimulating factor, such as granulocyte macrophage colony 
stimulating factor (GM-CSF; see, e.g., 5,908,763F. Generally, such 
GM-CSF is a primate GM-CSF, including human GM-CSF. Other immuno- 
stimulatory genes include, but are not limited to, genes that encode 

10 cytokines IL-1, IL-2, IL-4, IL-5, IFN, TNF, IL-12, IL-18, and flt3, proteins 
that stimulate interactions with immune cells (B7, CD28, MHC class I, 
MHC class II, TAPs), tumor-associated antigens (immunogenic sequences 
from MART-1, gp100 (pmel-17), tyrosinase, tyrosinase-related protein 1, 
tyrosinase-related protein 2, melanocyte-stimulating hormone receptor, 

15 MAGE1, MAGE2, MAGE3, MAGE12, BAGE, GAGE, NY-ESO-1, -catenin, 
MUM-1, CDK-4, caspase 8, KIA 0205, HLA-A2R1701, -fetoprotein, 
telomerase catalytic protein, G-250, MUC-1, carcinoembryonic protein, 
p53, Her2/neu, triosephosphate isomerase, CDC-27, LDLR-FUT, 
telomerase reverse transcriptase, and PSMA), cDNAs of antibodies that 

20 block inhibitory signals (CTLA4 blockade), chemokines (MIP1, MIP3, 
CCR7 ligand, and calreticulin), and other proteins. 

Other polynucleotides, including therapeutic nucleic acids, such as 
therapeutic genes, of interest include, but are not limited to, anti-angio- 
genic, and suicide genes. Anti-angiogenic genes include, but are not 

25 limited to, genes that encode METH-1, METH -2, TrpRS fragments, pro- 
liferin-related protein, prolactin fragment, PEDF, vasostatin, various 
fragments of extracellular matrix proteins and growth factor/cytokine 
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inhibitors. Various fragments of extracellular matrix proteins include, but 
are not limited to, angiostatin, endostatin, kininostatin, fibrinogen-E 
fragment, thrombospondin, tumstatin, canstatin, and restin. Growth 
factor/cytokine inhibitors include, but are not limited to, VEGF/VEGFR 
5 antagonist, sFlt-1, sFlk, sNRP1, angiopoietin/tie antagonist, sTie-2, 
chemokines (IP-10, PF-4, Gro-beta, IFN-gamma (Mig), IFN, FGF/FGFR 
antagonist (sFGFR), Ephrin/Eph antagonist (sEphB4 and sephrinB2), 
PDGF, TGF and IGF-1. 

A "suicide gene" encodes a protein that can lead to cell death, as 

10 with expression of diphtheria toxin A, or the expression of the protein can 
render cells selectively sensitive to certain drugs, e.g., expression of the 
Herpes simplex thymidine kinase gene (HSV-TK) renders cells sensitive to 
antiviral compounds, such as acyclovir, gancyclovir and FIAU 
(1 -(2-deoxy-2-fluoro-)ff-D-arabinofuranosil)-5-iodouracil). Other suicide 

15 genes include, but are not limited to, genes that encode carboxypeptidase 
G2 (CPG2), carboxylesterase (CA), cytosine deaminase (CD), cytochrome 
P450 (cyt-450), deoxycytidine kinase (dCK), nitroreductase (NR), purine 
nucleoside phosphorylase (PNP), thymidine phosphorylase (TP), varicella 
zoster virus thymidine kinase (VZV-TK), and xanthine-guanine 

20 phosphoribosyl transferase (XGPRT). Alternatively, a therapeutic nucleic 
acid can exert its effect at the level of RNA, for instance, by encoding an 
antisense message or ribozyme, a protein that affects splicing or 3' 
processing (e.g., polyadenylation), or a protein that affects the level of 
expression of another gene within the cell, e.g. by mediating an altered 

25 rate of mRNA accumulation, an alteration of mRNA transport, and/or a 
change in post-transcriptional regulation. The addition of a therapeutic 
nucleic acid to a virus results in a virus with an additional antitumor 
mechanism of action. Thus, a single entity (i.e., the virus carrying a 
therapeutic transgene) is capable of inducing multiple antitumor 
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mechanisms. Other encoded proteins, include, but are not limited to, 
herpes simplex virus thymidine kinase (HSV-TK), which is useful as a 
safety switch (see, U.S. Patent Application No. 08/974,391, filed 
November 19, 1997, which published as PCT Publication No. 
5 WO/9925860), Nos, FasL, and sFasR (soluble Fas receptor). 

Also contemplated are combinations of two or more transgenes 
with synergistic, complementary and/or nonoverlapping toxicities and 
methods of action. The resulting adenovirus can retain the viral oncolytic 
functions and, for example, additionally are endowed with the ability to 
10 induce immune and anti-angiogenic responses and other responses as 
desired. 

Therapeutic polynucleotides and heterologous polynucleotides also 
include those that exert an effect at the level of RNA or protein. These 
include a factor capable of initiating apoptosis, RNA, such as RNAi and 

15 other double-stranded RNA, antisense and ribozymes, which among other 
capabilities can be directed to mRNAs encoding proteins essential for 
proliferation, such as structural proteins, transcription factors, 
polymerases, genes encoding cytotoxic proteins, genes that encode an 
engineered cytoplasmic variant of a nuclease (e.g. RNase A) or protease 

20 (e.g. trypsin, papain, proteinase K and carboxypeptidase). Other 

polynucleotides include a cell or tissue specific promoters, such as those 
used in oncolytic adenoviruses (see, e.g., U.S. Patent No. 5,998,205). 

The heterologous polynucleotide encoding a polypeptide also can 
contain a promoter operably linked to the coding region. Generally the 

25 promoter is a regulated promoter and transcription factor expression 
system, such as the published tetracycline-regulated systems, or other 
regulatable systems (WO 01/30843), to allow regulated expression of the 
encoded polypeptide. Exemplary of other promoters, are tissue-selective 
promoters, such as those described in U.S. Patent No. 5,998,205. An 
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exemplary regulatable promoter system is the Tet-On (and Tet-Off) 
system currently available from Clontech (Palo Alto 7 CA). This promoter 
system allows the regulated expression of the transgene controlled by 
tetracycline or tetracycline derivatives, such as doxycycline. This system 
5 can be used to control the expression of the encoded polypeptide in the 
viral particles and nucleic acids provided herein. Other regulatable 
promoter systems are known (see, e.g., published U.S. Application No. 
20020168714, entitled "Regulation of Gene Expression Using 
Single-Chain, Monomeric, Ligand Dependent Polypeptide Switches," 

10 which describes gene switches that contain ligand binding domains and 
transcriptional regulating domains, such as those from hormone 
receptors). Other suitable promoters that can be employed include, but 
are not limited to, adenoviral promoters, such as the adenoviral major late 
promoter and/or the E3 promoter; or heterologous promoters, such as the 

15 cytomegalovirus (CMV) promoter; the Rous Sarcoma Virus (RSV) 
promoter; inducible promoters, such as the MMT promoter, the 
metallothionein promoter; heat shock promoters; the albumin promoter; 
and the ApoAl promoter. 

Therapeutic transgenes can be included in the viral constructs and 

20 resulting particles. Among these are those that result in an "armed" 

virus. For example, rather than delete E3 region as in some embodiments 
described herein, all or a part of the E3 region can be preserved or 
re-inserted in an oncolytic adenoviral vector (discussed above). The 
presence of all or a part of the E3 region can decrease the 

25 immunogenicity of the adenoviral vector. It also increases cytopathic 
effect in tumor cells and decreases toxicity to normal cells. Typically 
such vector expresses more than half of the E3 proteins. 

Adenoviruses for therapy, including those for human therapy, are 
known. Such known viruses can be modified as provided herein to 
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increase infection of dendritic cells and/or increasing binding to receptors 
expressed on dendritic cells. The adenoviral vectors that are used to 
produce the viral particles can include other modifications. Modifications 
include modifications to the adenovirus genome that is packaged in the 
5 particle in order to make an adenoviral vector. As discussed above, 
adenovirus vectors and particles with a variety of modifications are 
available. Modifications to adenvoiral vectors include deletions known in 
the art, such as deletions in one or more of the El, E2a, E2b, E3, or E4 
coding regions. These adenoviruses are sometimes referred to as early 

10 generation adenoviruses and include those with deletions of all of the 
coding regions of the adenoviral genome ("gutless" adenoviruses, 
discussed above) and also include replication-conditional adenoviruses, 
which are viruses that replicate in certain types of cells or tissues but not 
in other types as a result of placing adenoviral genes essential for 

15 replication under control of a heterologous promoter (discussed above; 
see also U.S. Patent No. 5,998,205, U.S. Patent No. 5,801,029; U.S. 
patent application 60/348,670 and corresponding published International 
PCT application No. WO 02/06786). These include the cytolytic, 
cytopathic viruses (or vectors), including the oncolytic viruses discussed 

20 above. 

Alternatively, as discussed above, the vector can include a 
mutation or deletion in the E1b gene. Typically such mutation or deletion 
in the Elb gene is such that the E1b-19kD protein becomes 
non-functional. This modification of the E1b region can be combined with 
25 vectors where all or a part of the E3 region is present. 
H. Formulation and administration 

1 . Formulation 

Compositions containing therapeutically effective for concentrations 
of recombinant adenovirus delivery vectors for delivery of therapeutic 
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gene products to target cells and/or tissues (i.e. dendritic cells). Modes 
of administration include, but are not limited to, intramuscular, parenteral, 
local, topical and other routes whereby dendritic cells can be targeted. 
The recombinant viral compositions also can be formulated for in 
5 sustained released formulations, such as adsorbed to biodegradable 

supports, including collagen sponges, or in liposomes. Sustained release 
formulations can be formulated for multiple dosage administration, so that 
during a selected period of time, such as a month or up to about a year, 
several dosages are administered. Thus, for example, liposomes can be 

10 prepared such that a total of about two to up to about five or more times 
the single dosage is administered in one injection. The vectors are 
formulated in pharmaceutical^ acceptable carriers. 

The composition can be provided in a sealed sterile vial containing 
an amount such that upon administration a sufficient amount of viral 

15 particles is delivered where about 50 to 150 //I, containing at least about 
10 7 , or 10 8 plaque forming units (pfu) in such volume are delivered and at 
least 10 9 -10 10 pfu are delivered. 

To prepare compositions the viral particles are dialzyed into a 
suitable carrier or viral particles can be concentration and/or mixed 

20 therewith. The resulting mixture can be a solution, suspension or 

emulsion. In addition, the viral particles may be formulated as the sole 
pharmaceutical^ active ingredient in the composition or may be combined 
with other active agents for the particular disorder treated. 

Exemplary suitable carriers include, but are not limited to, 

25 physiological saline, phosphate buffered saline (PBS), balanced salt 
solution (BSS), lactate Ringers solution, and solutions containing 
thickening and solubilizing agents, such as glucose, polyethylene glycol, 
and polypropylene glycol and mixtures thereof. Liposomal suspensions 
also can be suitable as pharmaceutical^ acceptable carriers. These can 
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be prepared according to methods known to those skilled in the art. 

The compositons can be prepared with carriers that protect them 
against rapid elimination from the body, such as time release formulations 
or coatings. Such carriers include controlled release formulations, such 
5 as, but not limited to, microencapsulated delivery systems, and 

biodegradable, biocompatible polymers, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, polyorthoesters, polylactic acid and 
other types of implants that may be placed directly into the body. The 
compositions also can be administered in pellets, such as Elvax pellets 

10 (ethylene-vinyl acetate copolymer resin). 

Liposomal suspensions, including tissue-targeted liposomes, also 
can be suitable as pharmaceutical^ acceptable carriers. For example, 
liposome formulations may be prepared by methods known to those of 
skill in the art (see, e.g., Kimm eta/. (1983) Bioch. Bioph. Acta 728:339- 

15 398; Assil eta/. (1987) Arch Ophthalmol. 705:400; and U.S. Patent No. 
4,522,81 1). The viral particles can be encapsulated into the aqueous 
phase of liposome systems. 

The active materials also can be mixed with other active materials, 
that do not impair the desired action, or with materials that supplement 

20 the desired action or have other action, including viscoelastic materials, 
such as hyaluronic acid, which is sold under the trademark HEALON, 
which is a solution of a high molecular weight (MW) of about 3 millions 
fraction of sodium hyaluronate (manufactured by Pharmacia, Inc; see, 
e.g. , U.S. Patent Nos. 5,292,362, 5,282,851, 5,273,056, 5,229,127, 

25 4,517,295 and 4,328,803). Additional active agents may be included. 

The compositions can be enclosed in ampules, disposable syringes 
or multiple or single dose vials made of glass, plastic or other suitable 
material. Such enclosed compositions can be provided in kits. In 
particular, kits containing vials, ampules or other container. 
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Finally, the vectors can be packaged as articles of manufacture 
containing packaging material, typically a vial, a pharmaceutically 
acceptable composition containing the viral particles and a label that 
indicates the therapeutic use of the composition. 
5 Also provided are kits for practice of the methods herein. The kits 

contain one or more containers, such as sealed vials, with sufficient 
composition for single dosage administration, other reagents as needed, 
and optionally instructions for use. 

Administration of the composition is typically by intravenous or 
10 intramuscular injection, although other modes of administration can be 
effective. 

2. Administration 

The compositions containing the compounds are generally 
administered systemically. It is further understood that, for any particular 

15 subject, specific dosage regimens should be adjusted over time according 
to the individual need and the professional judgment of the person 
administering or supervising the administration of the recombinant 
viruses, and that the concentration ranges set forth herein are exemplary 
only and are not intended to limit the scope or practice of the methods, 

20 uses, and products provided herein. 

In addition to in vivo administration, the viral particles provided 
herein can be used in methods of ex vivo therapy in which mixtures of 
cells, such as bone marrow cells, that include or contain dendritic cells or 
that are enriched for dendritic cells are contacted with the viral particles 

25 so that dendritic cells are preferentially infected. The resulting cells are 
optionally culture in vitro and are then infused into a recipient subject, 
generally the donor. 
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I. Diseases, Disorders and therapeutic products 

Dendritic cells modified with adenovirual particles provided herein 
express heterologous proteins that can be presented or that can alter 
dendritic cell functioning. As noted, the adenoviral particles can be 
5 administered to a subject or can be contacted ex vivo with dendritic 
cells obtained from a donor and can be infused into a subject patient, 
typically the donor. The viral particles, which express fibers targeted to 
dendritic cells will preferentially infect dendritic cells. Dendritic cells 
modified to express particular antigens act as vaccines by stimulating an 

10 immune response against the presented antigen. These cells can be used 
for treatment or prophylaxis of virtually any bacterial, protozoan, parasitic, 
fungal or other infection. In addition, presentation of a tumor antigen 
renders such cells effective for treatment or prophylaxis of cancers. 

Expression of a product that interferes with dendritic cell function, 

15 such as by blocking expression of genes, including genes encoding NF/cB 
or RelB, prevent the dendritic cells from stimulating T-cells. Such 
particles and the resulting cells can be used to treat diseases such as 
asmtha, allergies, autoimmune diseases, such as juvenille diabetes, 
rheumatoid arthritis, lupus and inflammatory diseases. 

20 Pathogens include, but are not limited to, bacterial, such as £. coli 

and anthrax, viruses, such as vaccinia virus [i.e. small pox, chicken pox), 
herpes viruses, cytomegalovirus (CMV) vectors, papillomavirus, parasites 
and fungi. Selected antigens can be determined empirically by identifying 
those that are effective in generatin an immunoprotective response in a 

25 model system such as a rodent model. 

Treatment with the particles, either in vivo or ex vivo can be 
prophylactic where administration (vaccination) generates immunity or it 
can be for treatment of the disease. 
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J. Examples 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 

EXAMPLE 1 

Construction of Ad5 Vectors Containing the Fiber AB Loop, KOI and 
Penton, PD1 Mutations and Derivatives Thereof 

Three recombinant adenoviral vectors were prepared that contain 

the K01 fiber or PD1 penton base mutations either alone or in 

combination, these vectors are designated Av3nBgFK01 Av1 nBgPDI , and 

Av1 nBgFKOI PD1 . Construction of these vectors is described below and 

a general description of each vector is set forth in Table 1 . 

TABLE 1 
Description Of Detargeted 
Recombinant Adenoviral Vectors Used For Scale-up 

Vector 



Vector 


Description 


Av3nBg 


An E1, E2a, E3-deleted adenoviral vector encoding a nuclear 
localizing /?-galactosidase 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing /?-galactosidase 


Av3nBgFK01 


The same as Av3nBg but containing the K01 mutation in the 
fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the PD1 mutation in the 
penton gene 


Av1nBgFK01 PD1 


The same as AvInBg but containing the fiber KOI and penton 
PD1 mutations 



AvInBg 

This is a well-known vector, its sequence is set forth in SEQ ID 
25 NO. 51. 
Av3nBg 

This is a well-known vector, its sequence is set forth in SEQ ID 
NO. 52. 



10 
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AvSnBgFKOI 

Genetic incorporation of the K01 fiber mutation to generate 
Av3nBgFK01 

The adenoviral vector Av3nBgFK01 was generated in an E1-, E2a-, 
5 E3-deleted backbone based on the adenovirus serotype 5 genome. It 
contains a RSV promoted nuclear-localizing /?-galactosidase gene in place 
of the E1 region. In addition, the fiber gene carries the KOI mutation. 
This mutation results in a substitution of fiber amino acids 408 and 409, 
changing them from serine and proline to glutamic acid and alanine, 

10 respectively. 

The vector was constructed as follows. First, the plasmid pSKOI 
(Figure 1) was digested with the restriction enzymes Sphl and Muni. The 
resulting DNA fragments were separated by electrophoresis on an agarose 
gel. The 1601 bp fragment containing all but the 5' end of the fiber gene 

15 was excised from the agarose gel and the DNA was isolated and purified. 
The fragment was then ligated with the 9236 bp fragment of 
p5FloxHRFRGD, which had been digested with Sphl and Muni. The 
resulting plasmid, pSFloxHRFKOI , was digested with Spel and Pad and 
the 6867 bp fragment containing the fiber gene was isolated. The 

20 fragment was ligated with the 24,630 bp Spel-PacI fragment of 

pNDSQ3.1. The resulting plasmid, pNDSQ3.1K01 (Figure 2), was used 
together with pAdmireRSVnBg (Figure 3A) to generate a plasmid which 
encodes the full-length adenoviral vector genome. It, however, was 
necessary to remove the Pad site from pNDSQ3.1K01 (Figure 2) prior to 

25 recombination with pAdmireRSVnBg (Figure 3A) so that the final plasmid 
contains a unique Pad site adjacent to the 5' ITR. The Pad site in 
pNDSQ3.1K01 was removed by digestion with Pad followed by blunting 
with T4 DNA Polymerase and religation. The resulting plasmid was called 
pNDSQ3.1K01(Pac. 
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To generate a full-length plasmid containing the entire adenoviral 
genome, pAdmireRSVnBg (Figure 3A) was digested with Sail and 
co-transfected into competent cells of the E. coli strain BJ5183 along 
with pNDSQ3.1 KOI APac, which had been digested with BstBI. 
5 Homologous recombination between the two plasmids generated a 

full-length plasmid encoding the entire adenoviral vector genome, which 
was called pFLAv3nBgFK01 . 

The plasmid pFLAv3nBgK01 was linearized with Pad and 
transfected into 633 cells. In the fiber complementing 633 cell line, the 

10 resulting viral DNA containing the K01 mutation is capable of being 

packaged into infectious viral particles containing a mixture of wildtype 
fiber and mutant fiber proteins. After five rounds of amplification in 633 
cells, a cytopathic effect was observed. Three more rounds of 
amplification in 633 cells were performed followed by purification of the 

15 virus by standard CsCl centrifugation procedures. This viral preparation 
was used to infect AE1-2a cells, which do not express fiber. The 
resulting virus contained only the mutant fiber protein on its capsid. Virus 
particles were purified by standard CsCl centrifugation procedures. 
AvInBgFKOI 

20 The v ector AvInBgFKOI is made in a similar manner to 

Av3nBgFK01 described above. 
Av1nBgK012 

An additional fiber AB loop mutation (described by Einfeld et al. 
(2001) J. Virology 75'A 1284-1 1291) was incorporated into the genome 
25 of AvInBg. This AB loop mutation is a four amino acid substitution, 
R512S, A515G, E516G, and K517G, and is referred to as K012. The 
K012 mutation was incorporated into the fiber gene by PCR gene overlap 
extension using the plasmid pSQ1 (Figure 3B) as template. The pSQ1 
plasmid contains most of the Ad5 genome, extending from base pair 



WO 2004/099422 



PCT7US2004/009219 



-92- 

3329 through the right ITR, in a pBR322 backbone. First, a segment of 
the Ad5 genome extending from within the E3 region into the fiber gene 
was amplified by PCR using the plasmid pSQ1 as a template with the 
following primers termed 5FF, 5'-GAA CAG GAG GTG AGC TTA GA-3' 
5 SEQ ID NO. 53), and 5FR, 5'-TCC GCC TCC ATT TAG TGA ACA GTT 
AGG AGA TGG AGC TGG TGT G-3' {SEQ ID NO. 54). The primer 5FR 
contains an 18 base 5'-extension that encodes the modified fiber AB loop 
amino acids from 512 through 517. A second PCR using pSQ1 as a 
template amplified the region immediately 3' of the AB loop substitution 
10 and extending past the Muni site located 40 base pairs 3' of the fiber 
gene stop codon. The two primers used for this reaction were 3FF: 
5'-TCA CTA AAT GGA GGC GGA GAT GCT AAA CTC ACT TTG GTC 
TTA AC-3' (SEQ ID NO. 55), and 3FR: 5'-GTG GCA GGT TGA ATA CTA 
GG-3' (SEQ ID NO. 56). The primer 3FR contains an 18 base 
15 5'-extension that encodes the modified fiber AB loop amino acids 512 
through 517. Amplified products of the expected size were obtained and 
used in a second PCR with the end primers 5FF and 3FR to join the 
fragments together. The K012 PCR fragment was digested with Xbal 
and Muni cloned directly into the fiber shuttle plasmid, pFBshuttle(EcoRI) 
20 to generate the plasmid pFBSEK012 which contains the 8.8kB EcoRI 

fragment of pSQ1 . The pFBSEK012 plasmid was digested with Xbal and 
EcoRI and cloned into pSQ1 using a three-way ligation to generate 
pSQ1K012 (Figure 3C). The K012 cDNA was incorporated into the 
genome of AvInBg, an adenovirus vector with E1 and E3 deleted 
25 encoding /?-galactosidase, by homologous recombination between 

Clal-linearized pSQ1K012 and pAdmireRSVnBg digested with Sail and 
Pad to generate Av1nBgK012. The K012 vector was transfected in 633 
cells, scaled-up on non-fiber expressing cells and purified, as described 
above for K01 . 
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AvlnBgPDI 

Genetic incorporation of the PD1 penton mutation to generate 
Av1nBgPD1 

The adenoviral vector Av1nBgPD1 is an E1-, E3-deIeted vector 
5 based on the adenovirus serotype 5 genome. It contains a RSV promoted 
nuclear-localizing ^-galactosidase gene in the E1 region and also contains 
the PD1 mutation in the penton gene. The PD1 mutation results in a 
substitution of amino acids 337 through 344 of the penton protein, 
HAIRGDTF (SEQ ID NO. 49), with amino acids SRGYPYDVPDYAGTS 

10 (SEQ ID NO. 50), thus replacing the RGD tripeptide (see, Einfeld et al. 
(2001) J. Virology 75:11284-11291). The mutation in the penton gene 
was generated in the plasmid pGEMpen5, which contains the Adenovirus 
serotype 5 penton gene. To generate the mutation, four oligonucleotides 
were synthesized. The sequences of the oligonucleotides were as 

15 follows: penton 1 : 5' CGC GGA AGA GAA CTC CAA CGC GGC AGO 

CGC GGC AAT GCA GCC GGT GGA GGA CAT GAA 3' (SEQ ID NO. 57); 
penton 2: 5' TAT CGT TCA TGT CCT CCA CCG GCT GCA TTG CCG 
CGG CTG CCG CGT TGG AGT TCT CTT CC 3' (SEQ ID NO. 58); penton 
3: 5' CGA TAG CCG CGG CTA CCC CTA CGA CGT GCC CGA CTA CGC 

20 GGG CAC CAG CGC CAC ACG GGC TGA GGA GAA GCG CGC 3' (SEQ 
ID NO. 59); penton 4: 5' TCA GCG CGC TTC TCC TCA GCC CGT GTG 
GCG CTG GTG CCC GCG TAG TCG GGC ACG TCG TAG GGG TAG CCG 
CGG C 3' (SEQ ID NO. 60). The complementary oligonucleotides penton 
1 and penton 2 were annealed to each other as were penton 3 and 

25 penton 4. The duplex generated by annealing penton 3 and penton 4 
encoded the substitution of amino acids 337 through 344 described 
above. The duplex generated by annealing penton 1 and penton 2 
possessed a 5 base 5' overhang which was compatible to a 5 base 5' 
overhang on the duplex generated by annealing penton 3 and penton 4. 
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The opposite end of the duplex generated by annealing penton 1 and 
penton 2 contained an Earl compatible overhang. The opposite end of the 
duplex generated by annealing penton 3 and penton 4 contained a BbvCI 
compatible overhang. The two duplexes were ligated to each other and 
5 ligated back into the pGEMpenS backbone as follows. First, pGEMpen5 
was digested with BbvCI and Pstl and the resulting DNA fragments were 
separated by electrophoresis on an agarose gel. The 3360 bp fragment 
was excised from the gel and purified. The plasmid pGEMpen5 was also 
digested with Pstl and Earl and the resulting fragments were separated by 

10 electrophoresis on an agarose gel. The 955 bp fragment was excised 
from the gel and purified. These two fragments from the pGEMpen5 
plasmid were ligated with the two pairs of annealed oligonucleotides to 
generate the plasmid pGEMpen5PD1 . 

The mutated penton gene was transferred from pGEMpen5PD1 to 

15 pSQ1 using a 5-way ligation as follows. First, the region of the penton 
gene containing the PD1 mutation was excised from pGEMpen5PD1 by 
digestion with Pvul and Ascl. The 974 bp fragment containing the PD1 
mutation was purified. Four DNA fragments were prepared from the 
pSQ1 plasmid (Figure 3B) as follows. The plasmid was digested with 

20 Csp45l and Fsel and the 9465 bp fragment was purified. In addition 
pSQ1 was digested with Fsel and Pvul and the 2126 bp fragment was 
purified. The plasmid pSQ1 was digested with Ascl and BamHI and the 
5891 bp fragment was purified. Finally, pSQl was digested with BamHI 
and Csp45l and the 14610 bp fragment was purified. The 5 purified DNA 

25 fragments were ligated to each other to form the plasmid pSQ1PD1 
(Figure 4). 

To generate adenoviral vector, pSQ1PD1 was linearized by 
digestion with Clal and co-transfected into PerC6 cells with 
pAdmireRSVnBg (Figure 3A) which had been digested with Sail and Pad. 
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hexadimethrine bromide was maintained in the medium at 4 jL/g/ml. When 

a cytopathic effect was observed, a crude viral lysate was further 

expanded on PerC6 cells. The virus was purified by standard CsCI 

centrifugation procedures. 

5 AvInBgFKOIPDI 

Genetic incorporation of the fiber KOI or KOI 2 mutation in 
combination with the penton PD1 mutation to generate 
AvInBgFKOIPDI 

The adenoviral vectors AvInBgFKOIPDI and Av1 nBgKOI 2PD1 

10 were generated in an E1-, E3-deleted adenovirus serotype 5 genome. 

Both vectors contains a RSV promoted nuclear-localizing /?-galactosidase 
gene in the E1 region and also contains either the KOI or K012 mutation 
in the fiber gene as well as the PD1 mutation in the penton gene. The 
vectors were constructed as follows. First, the plasmid pSQ1PD1 was 

15 digested with Csp45l and Spel and the 23976 bp fragment containing the 
PD1 mutated penton gene was purified. In addition, the plasmids 
pSQ1K01 or pSQ1K012 (Figure 3B) were digested with Csp45l and Spel 
and the 9090 bp fragment containing the KOI or K012 mutated fiber 
gene were purified. The appropriate purified fragments were ligated to 

20 each other to from the plasmid pSQ1FK01PD1 (Figure 5A) or 

pSQ1K012PDl (Figure 5B) that contains the KOI (or K012) mutated 
fiber gene and the PD1 mutated penton gene. To generate virus, 
pSQ1FK01PD1 or pSQK012PD1 was linearized with Clal and co-trans- 
fected into 633 cells with pAdmireRSVnBg (Figure 3A) which had been 

25 digested with Sail and Pad. After three rounds of amplification in 633 

cells a cytopathic effect was observed and the crude viral lysate was then 
amplified on PerC6 cells. Hexadimethrine bromide was maintained in the 
medium at 4 //g/ml. Each virus was purified by standard CsCI 
centrifugation procedures. 
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EXAMPLE 2 

In Vitro Evaluation of Adenoviral Vectors Containing the KOI and PD1 
Mutations 

Several recombinant adenoviral vectors were used in these studies 
5 to demonstrate the function of the KOI fiber mutation and included 
AvInBg, Av1nBgFK01, Av1nBgPD1, and Av1 nBgFKOI PD1 , described 
above. The transduction efficiencies of adenoviral vectors containing the 
K01 and/or PD1 mutations were evaluated on cells of the alveolar 
epithelial cell line A549. The transduction efficiencies were compared to 
10 that of AvInBg, an adenoviral vector containing wild type fiber and 
penton. 

The day prior to infection, cells were seeded into 24-well plates at 
a density of approximately 1 x 10 5 cells per well. Immediately prior to 
infection, the exact number of cells per well was determined by counting 

15 a representative well of cells. Each of the vectors, AvInBg, 

Av1 nBgFKOI, and Av1 nBgFKOI PD1 were used to transduce A549 cells 
at each of the following particle per cell (PPC) ratios: 100, 500, 1000, 
2500, 5000, 10,000. The cell monolayers were stained with X-gal 24 
hours after infection and the percentage of cells expressing 

20 y&-gaIactosidase was determined by microscopic observation and counting 
of cells. Transductions were done in triplicate and three random fields in 
each well were counted, for a total of nine fields per vector. 

The results at the 500 PPC ratio are shown in Figure 6 and show a 
significantly reduced transduction efficiency on A549 cells using vectors 

25 containing the KOI mutation alone or when combined with PD1 

compared to AvInBg. The vectors containing the PD1 mutation alone 
had no effect on adenoviral transduction of A549 cells in vitro. 
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EXAMPLE 3 

In Vivo Analysis of Adenoviral Vectors Containing the FKOI and PD1 
Mutations 

This Example provides experiments that evaluate the in vivo 
5 biodistribution of adenoviral vectors containing the K01 and PD1 

mutations and their influence on adenoviral-mediated liver transduction. 
The results show that ablating the viral interaction with CAR and/or 
integrins is not sufficient to fully detarget adenoviral vectors from the liver 
in vivo. 

10 A positive control cohort received AvInBg and a negative control 

group received HBSS. Additionally, the Av1 nBgFKOI 2 and 
Av1 nBgFKOI 2PD1 vectors were analyzed in vivo. These vectors each 
contain a fiber protein with the four amino acid substitution in the AB 
loop. Additionally, Av1 nBgFKOI 2PD1 contains a mutation in the penton 

15 base. Both of these mutations were known (see, Einfeld eta/. (2001) J. 
Virology 75\ \ 1284-1 1291), and were alleged to decrease liver 
transduction 10 to 700 fold, respectively. Cohorts of five C57BL/6 mice 
received each vector via tail vein injection at a dose of 1 x 10 13 particles 
per kg. The animals were sacrificed approximately 72 hours after vector 

20 administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, 
and kidney were collected from each animal. The median lobe of the liver 
was placed in neutral buffered formalin to preserve the sample for 
yS-galactosidase immunohistochemistry. In addition, tissue from each 
organ was frozen to preserve it for hexon PGR analysis to determine 

25 vector content. A separate sample of liver from each mouse was frozen 
to preserve it for a chemiluminescent /?-galactosidase activity assay. 

For /?-galactosidase immunohistochemistry slices of liver, 
approximately 2-3 mm thick, were placed in 10% neutral buffered 
formalin. After fixation, these samples were embedded in paraffin, 
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sectioned, and analyzed by immunohistochemistry for /?-galactosidase 
expression. A 1:1200 dilution was used of a rabbit anti-yff-galactosidase 
antibody (ICN Pharmaceuticals, Inc.; Costa Mesa, CA) in conjunction with 
a Vectastain ABC kit (Vector Laboratories, Inc., Burlingame, CA) to 
5 visualize positive cells. 

The chemiluminescent /?-galactosidase activity assay was 
performed using the Galacto-Light PlusTM chemiluminescent assay 
(Tropix, Inc., Foster City, CA) system. Tissue samples were collected in 
lysis matrix tubes containing two ceramic spheres (Bio101, Carlsbad, CA) 

10 and frozen on dry ice. The tissues were thawed and 500 jj\ of lysis buffer 
from the Galacto-Light Plus kit was added to each tube. The tissue was 
homogenized for 30 seconds using a FastPrep System (Bio101, Carlsbad, 
CA). Liver samples were homogenized for an additional 30 seconds. 
/?-galactosidase activity was determined in the liver homogenates 

15 according to the manufacture's protocol. 

For hexon PCR analysis DNA from tissues was isolated using the 
Qiagen Blood and Cell Culture DNA Midi or Mini Kits (Qiagen Inc., 
Chatsworth, CA). Frozen tissues were partially thawed and minced using 
sterile disposable scalpels. Tissues were then lysed by incubation 

20 overnight at 55° C in Qiagen buffer G2 containing 0.2 mg/ml RNaseA and 
0.1 mg/ml protease. Lysates were vortexed briefly and then applied to 
Qiagen-tip 100 or Qiagen-tip 25 columns. Columns were washed and 
DNAs were eluted as described in the manufacturer's instructions. After 
precipitation, DNAs were dissolved in water and the concentrations were 

25 spectrophotometrically determined (A260 and A280) on a DU-600 
(Beckman Coulter, Inc.; Fullerton, CA) or a SPECTRAmax PLUS 
(Molecular Devices, Inc.; Sunnyvale, CA) spectrophotometer. 2.3.2. 
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PCR primers and a Taqman probe specific to adenovirus hexon 
sequences were designed using Primer Express software v. 1.0 (Applied 
Biosystems, Foster City, CA). Primer and probe sequences were: 
Hexon Forward Primer (SEQ ID NO. 61): 
5 5'-CTTCGATGATGCCGCAGTG-3' 
Hexon Reverse Primer (SEQ ID NO. 62): 
5'-GGGCTCAGGTACTCCGAGG-3' 
Hexon Probe (SEQ ID NO. 63): 

5'-FAM-TTACATGCACATCTCGGGCCAGGAC-TAMRA-3' 
10 Amplification was performed in a reaction volume of 50 //I under 

the following conditions: 10 ng (tumor) or 1 /jg (liver and lung) of sample 
DNA, 1X Taqman Universal PGR Master Mix (Applied Biosystems), 600 
nM forward primer, 900 nM reverse primer and 100 nM hexon probe. 
Thermal cycling conditions were: 2 minute incubation at 50° C, 10 
15 minutes at 95° C, followed by 35 cycles of successive incubation at 
95° C for 15 seconds and 60° C for 1 minute. Data was collected and 
analyzed using the 7700 Sequence Detection System software v. 1.6.3 
(Applied Biosystems). Quantification of adenovirus copy number was 
performed using a standard curve that includes dilutions of adenovirus 
20 DNA from 1,500,000 copies to 15 copies in the appropriate background 
of cellular genomic DNA. For analysis of tumor tissues, a standard curve 
in a background of 10 ng human DNA was generated. For analysis of 
mouse liver and lung tissues, a standard curve using the same adenovirus 
DNA dilutions in a background of 1 /jg CD-1 mouse genomic DNA was 
25 generated. Samples were amplified in triplicate, and the average number 
of total copies was normalized to copies per cell based on the input DNA 
weight amount and a genome size of 6 x 1 0 9 bp. 

The results of the /?-galactosidase activity assay and adenoviral 
hexon DNA content for liver transduction by these vectors are shown in 
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Figure 7A and 7B. The vector containing the K01 or K012 mutations 
alone showed, on average, a slight increase in liver transduction 
compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined 
5 with KOI or K012 showed a slight decrease in liver transduction 
compared to AvInBg, suggesting that integrins are involved to some 
extent in hepatic uptake of the adenoviral vectors. 

The results of the immunohistochemical staining of liver sections 
for yff-galactosidase were consistent with the activity assays (data not 

10 shown) and demonstrate that gene expression was localized specifically 
to hepatocytes. The vectors containing the K01 or K012 mutation alone 
showed a slight increase in liver transduction as revealed by a more 
intense and frequent immunohistochemical-staining pattern. The vectors 
containing the PD1 mutation, either alone or combined with K01 or 

15 KOI 2, showed little difference in transduction compared to AvInBg. 
These results demonstrate that ablating the viral interaction with CAR 
and/or integrins is not sufficient to fully detarget adenoviral vectors from 
the liver in vivo. 

In summary, the fiber AB loop mutation contained in AvInBgFKOI 
20 or AvlnBgKO!2 ablates interaction with human and mouse CAR in vitro 
and diminished transduction in vitro. In vivo, however, fiber AB loop 
mutations behaved unexpectantly, because such mutations were found to 
enhance adenoviral-mediated gene transfer to liver and results in 
increasing vector potency. The penton base, PD1 mutation that ablates 
25 interaction with the second receptor involved in adenoviral internalization 
had no effect in vitro and little to no effect in vivo. These studies 
indicated that other receptors are responsible for adenoviral gene transfer 
to the liver in vivo. 
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EXAMPLE 4 

Description Of Adenoviral Vectors Containing A Fiber With Amino Acid 
Substitutions At The Heparin Sulfate Binding Domain In The Fiber Shaft 

Vectors containing substitutions at all four of the amino acids in the 

5 four amino acid motif in the Ad5 fiber shaft (residues 91 to 94, KKTK; 

SEQ ID NO. 45) were generated in order to ablate the potential interaction 

with HSP. The mutation is termed HSP because it potentially eliminates 

binding to heparan sulfate proteoglycans. Vectors containing the HSP 

mutation alone and combined with the KOI mutation (fiber knob AB loop 

10 mutation that ablates CAR binding), the PD1 mutation (penton mutation 
that eliminates RGD/integrin interaction), and a triple knockout vector 
(HSP, K01, PD1) were generated. 

Generation of the HSP fiber mutation: The HSP mutation was 
incorporated into the fiber gene by using a PCR-based strategy of gene 

15 splicing by overlap extension (PGR SOEing). First, a segment of the Ad5 
genome extending from within the E3 region into the 5' end of the fiber 
gene was amplified by PCR using the plasmid pSQ1 (Figure 3B) as a 
template and two primers termed 5FF and 5HSPR. The DNA sequence of 
5FF is as follows: 5' GAA CAG GAG GTG AGO TTA GA 3' (SEQ ID NO. 

20 53). This sequence corresponds to base pairs 25,199 - 25,218 of pSQ1. 
The DNA sequence of 5HSPR is as follows: 5' GGC TCC GGC TCC GAG 
AGG TGG GCT CAC AGT GGT TAG ATT T 3' (SEQ ID NO. 64). 5HSPR 
is a reverse primer for 5FF and corresponds to a region in the fiber shaft 
adjacent to the KKTK (SEQ ID NO. 45) region. The primer contains a 5' 

25 extension that encodes a GAGA substitution for the native KKTK 

(encoded by SEQ ID NO. 45) amino acid sequence. A second PCR using 
pSQ1 as a template amplified the region immediately 3' of the KKTK (SEQ 
ID NO. 45) site and extending past the Muni site located 40 base pairs 3' 
of the stop codon for the fiber gene. The two primers used for this 
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reaction were 3HSPF and 3FR. The DNA sequence of 3HSPF is as 
follows: 5' GGA GCC GGA GCC TCA AAC ATA AAC CTG GAA AT 3' 
(SEQ ID NO. 16). It contains a 5' extension that is complementary to the 
5' extension of 5HSPR. The DNA sequence of 3FR is as follows: 5' GTG 
5 GCA GGT TGA ATA CTA GG 3' (SEQ ID NO. 56). 

The two PCR products were joined by PGR SOEing using primers 
5FF and 3FR. The resulting PCR product was digested with the 
restriction enzymes Xbal and Muni. The 2355 bp fragment was gel 
purified and ligated with the 6477 bp Xbal to Muni fragment of the 

10 plasmid pFBshuttle(EcoRI) (Figure 8) to generate the plasmid pFBSEHSP. 
The plasmid pFBshuttle(EcoRI) was generated by digesting the plasmid 
pSQ1 with EcoRI, then gel purifying and self-ligating the 8.8 kb fragment 
containing the fiber gene. Next, the fiber gene containing the HSP 
mutation was transferred from pFBSEHSP into pSQ1 using a three-way 

15 ligation. The 16,431 bp EcoRI to Ndel fragment of pSQ1, the 9043 bp 
Ndel to Xbal fragment of pSQ1, and the 7571 bp Xbal to EcoRI fragment 
of pFBSEHSP were isolated and ligated to generate pSQIHSP (Figure 9). 

To generate a recombinant adenoviral vector containing the HSP 
mutation in the fiber gene, pSQIHSP was digested with Clal and 

20 pAdmireRSVnBg (Figure 3A) was digested with Sail and Pad, then the 

two digested plasmids were co-transfected into 633 cells (von Seggern et 
al. (2000) J Virology 74:354-362). Homologous recombination between 
the two plasmids generated a full-length adenoviral genome capable of 
replication in 633 cells, which inducibly express Ad5 E1A and 

25 constitutively express wild-type fiber protein. After propagation on 633 
cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP 
mutation, the viral preparation was used to infect PerC6 cells, which do 
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not express fiber. The resulting virus, termed AvInBgFS*, was purified 
by standard CsCI centrifugation procedures. 

Generation of vector containing the HSP and KOI mutations 
To generate an adenoviral vector containing the HSP and KOI 
5 mutations in fiber, a PCR SOEing strategy identical to the one described 
above was used except that the plasmid pSQIFKOI was used as the 
template. The PCR SOEing product was digested with Xbal and Muni and 
ligated with the 6477 bp Xbal to Muni fragment of pFBshuttle(EcoRI) to 
generate pFBSEHSPKOI . The fiber gene containing the HSP and KOI 

10 mutations was transferred from pFBSEHSPKOI into the pSQ1 backbone 
using a three-way ligation strategy identical to the one described above 
for the HSP mutation alone, to generate the plasmid pSQIHSPKOI 
(Figure 10). Recombinant adenoviral vector containing the HSP and K01 
mutations in the fiber gene was generated by co-transfecting 

15 pSQIHSPKOI digested with Clal and pAdmireRSVnBg digested with Sail 
and Pad into 633 cells. Adenovirus was propagated and purified as 
described above for the vector containing the HSP mutation alone. The 
resulting virus was termed Av1 nBgFKOI S*. 

Generation of vector containing the HSP and PD1 mutations 

20 The following strategy was used to generate a recombinant 

adenoviral vector containing the fiber HSP mutation and the penton PD1 
mutation. The plasmid pSQIPDl (Figure 4) was digested with the 
restriction enzymes Csp45l and Spel and the 23,976 bp fragment was 
isolated and purified. In addition, the plasmid pSQIHSP was also 

25 digested with Csp45l and Spel and the 9090 bp fragment was isolated 
and purified and ligated to the 23,976 bp fragment to generate the 
plasmid pSQ1HSPPD1 (Figure 11), which contains the fiber HSP and 
penton PD1 mutations. An adenoviral vector was generated, propagated, 
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and purified as described above. The resulting virus was termed 
Av1nBgS*PD1. 

Generation of vector containing the HSP, KOI, and PD1 mutations 

To generate an adenoviral vector containing the HSP, K01, and 
5 PD1 mutations the following strategy was used. First, the plasmid 
pSQ1PD1 was digested with Csp45l and Spel and the 23,976 bp 
fragment was isolated and purified. In addition, the plasmid 
pSQIHSPKOI was digested with Csp45l and Spel and the 9090 bp 
fragment was isolated and purified. The two DNA fragments were ligated 
10 to form the plasmid pSQ1 HSPK01 PD1 (Figure 12). Recombinant 

adenoviral vector was generated, propagated, and purified as described 
above. The resulting virus was termed Av1 nBgFKOI S*PD1 . 

EXAMPLE 5 

In Vitro Evaluation Of Adenoviral Vectors Containing The HSP Fiber 
15 Mutation 

The transduction efficiencies of adenoviral vectors containing the 
HSP mutation in the fiber gene, either alone or combined with the K01 
and/or PD1 mutations, were evaluated on A549 and HeLa cells. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral 

20 vector containing wild type fiber and penton. The day prior to infection, 
cells were seeded into 24-welI plates at a density of approximately 1 x 
10 5 cells per well. Immediately prior to infection, the exact number of 
cells per well was determined by counting a representative well of cells. 
Each of the vectors, AvlnBg (see, Stevenson et al. (1997) J. Virol. 

25 77:4782-4790), AvInBgS*, Av1 nBgFKOI S*, Av1nBgS*PD1, and 

Av1 nBgFKOI S*PD1, were used to transduce A549 cells at each of the 
following particle per cell (PPC) ratios: 100, 500, 1000, 2500, 5000, 
10,000. HeLa cells were transduced with each of the above vectors, as 
well as a vector containing the K01 mutation alone (Av1 nBgFKOI) and a 
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vector containing the PD1 mutation alone (Av1nBgPD1) at 2000 PPC. 
The cell monolayers were stained with X-gal 24 hours after infection and 
the percentage of cells expressing /?-galactosidase was determined by 
microscopic observation and counting of cells. Transductions were done 
5 in triplicate and three random fields in each well were counted, for a total 
of nine fields per vector. 

The results (depicted in Figures 13A-13B) showed significantly 
reduced transduction efficiencies on A549 and HeLa cells using vectors 
containing the HSP mutation compared to AvInBg. The vectors 
10 containing the HSP mutations, however, demonstrated a dose response 
on A549 cells, in that increasing PPC ratios yielded increasing 
transduction. 

Competition experiments were done to determine which receptor 
molecular interactions are involved in transduction of A549 cells by the 

15 various vectors. Transductions were performed in the presence or 

absence of various competitors including Ad5 fiber knob, a 50 amino acid 
oligopeptide derived from Adenovirus serotype 2 penton base which 
spans the RGD tripeptide region, or heparin (Invitrogen Life Technologies, 
Gaithersburg, MD). Monolayers of A549 cells were cultured in Richters 

20 medium supplemented with 10% FBS and were transduced with AvlnBg, 
AvInBgS*, Av1 nBgFKOI S*, Av1nBgS*PD1, or Av1 nBgFKOI S*PD1 in 
infection medium (IM, Richters medium plus 2% FBS). Different PPC 
ratios were used for the different vectors to achieve measurable 
transduction levels. The PPC ratios were as follows: AvlnBg: 500 PPC, 

25 AvInBgS*: 10,000 PPC, AvInBgFKOIS*: 20,000 PPC, Av1nBgS*PD1: 
10,000 PPC, and Av1 nBgFKOI S*PD1 : 20,000 PPC. Fiber knob 
competition was performed by pre-incubating cells in IM containing 16 
/yg/ml of fiber knob for 10 minutes at room temperature prior to infection 
with virus. Penton base peptide competition was performed by 
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pre-incubating cells in IM containing 500nM peptide for 10 minutes at 
room temperature prior to infection with virus. Heparin competition was 
performed by pre-incubating each adenoviral vector in IM containing 3 
mg/ml of heparin for 20 minutes at room temperature. In all cases, the 
5 competitor remained in the IM during the 1 hour infection when virus was 
rocked on the cell monolayers at 37° C in 5% C02. After infection, the 
monolayers were washed with PBS, 1 ml of complete medium was added 
per well and the cells were incubated for an additional 24 hours to allow 
for yff-galactosidase expression. The cell monolayers were then fixed and 

10 stained with X-Gal. The percentage of cells transduced was determined 
by light microscopy as described above. Each condition was carried out in 
triplicate and three random fields per well were counted, for a total of 
nine fields per condition. The average percentage of transduction per 
high-power field was determined. 

15 The results of the competition experiment (Figure 1 3C) showed 

that fiber knob inhibited transduction of cells by all vectors except for 
those that contained the KOI mutation. The penton base peptide only 
inhibited transduction by Av1 nBgFKOI S*. Heparin inhibited transduction 
by Av1 nBgFKOI S* and Av1 nBgFKOI S*PD1 , but did not affect trans- 

20 duction by any of the other viruses suggesting the presence of additional 
heparin binding sites on the adenoviral capsid but that the shaft contains 
the predominant site. 

EXAMPLE 6 

In Vivo Analysis Of Adenoviral Vectors Containing The HSP Mutation In 
25 Fiber 

The objective of this study was to evaluate the in vivo 
biodistribution of adenoviral vectors containing the HSP mutation and to 
determine whether this shaft modification influences adenoviral-mediated 
liver transduction. In addition, vectors containing the HSP mutation 
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combined with K01, or PD1 , or a combination of all three mutations were 
evaluated as well as vectors containing the KOI mutation alone and the 
PD1 mutation alone. A positive control cohort received AvInBg and a 
negative control group received HBSS. Cohorts of five C57BL/6 mice 
5 received each vector via tail vein injection at a dose of 1 x 10 13 particles 
per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, 
and kidney were collected from each animal. The median lobe of the liver 
was placed in neutral buffered formalin to preserve the sample for 

10 /?-galactosidase immunohistochemistry. In addition, tissue from each 
organ was frozen to preserve it for hexon real time PGR analysis to 
determine vector content. A separate sample of liver from each mouse 
was frozen to preserve it for a chemiluminescent /?-ga!actosidase activity 
assay. /?-galactosidase immunohistochemistry, hexon real-time PCR and 

15 the chemiluminescent /?-galactosidase activity assay were carried out as 
described in Example 3. 

The results of the /?-galactosidase activity assay (Figure 14A) and 
adenoviral hexon DNA content (Figure 14B) showed a dramatic reduction 
in liver transduction by vectors containing the HSP mutation. The vectors 

20 containing the HSP mutation alone resulted in reducing 

adenoviral-mediated liver gene expression by approximately 20-fold. When 
combined with the K01 mutation (HSP, K01, PD1), yielded approximately 
a 1000-fold reduction in jff-galactosidase activity in the liver compared to 
the control vector AvInBg. The vector containing the KOI mutation 

25 alone showed a slight increase, on average, in liver transduction 
compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined 
with K01 showed a slight decrease in liver transduction compared to 
AvInBg, although the decrease was not statistically significant. Analysis 
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of hepatic adenoviral hexon DNA content (Figure 14B) confirmed these 
results. 

The results of the immunohistochemical staining of liver sections 
for /?-galactosidase were consistent with the activity assays (data not 
5 shown) and demonstrated that gene expression was localized specifically 
to hepatocytes. Vectors containing the HSP mutation, either alone or in 
combination with KOI and/or PD1, showed a dramatic reduction in 
hepatocyte transduction. The vector containing the KOI mutation alone 
showed a slight increase in liver transduction as revealed by a more 
10 intense and frequent immunohistochemical staining pattern. The vectors 
containing the PD1 mutation, either alone or combined with K01, showed 
little difference in transduction compared to AvInBg. 

EXAMPLE 7 

Description of Adenoviral Vectors Containing the HSP Fiber Shaft 
15 Mutation with and without the KOI Fiber Mutation and with and without 
a cRGD Targeting Ligand in the Fiber Knob HI Loop 

Generation of vector containing the HSP fiber shaft mutation and a cRGD 
ligand in the HI loop: The following strategy was used to generate an 
adenoviral vector containing a fiber with the HSP shaft mutation and a 

20 cRGD ligand in the HI loop. The plasmid pSFIoxHRFRGD was digested 
with the restriction enzymes BstXI and Kpnl and the 1 157 bp fragment 
was isolated and purified. In addition, the fiber shuttle plasmid 
pFBSEHSP, described in Example 1 above, was digested with BstXI and 
Kpnl and the 4549 bp and 3156 bp fragments were isolated and purified. 

25 The three fragments were ligated to generate the plasmid pFBSEHSPRGD, 
which encodes a fiber containing the HSP mutation and cRGD in the HI 
loop. The fiber gene from this plasmid was transferred into the pSQ1 
backbone as follows. The plasmid pFBSEHSPRGD was digested with 
EcoRI and Xbal and the 7601 bp fragment was isolated and purified. The 

30 plasmid pSQ1 (Figure 3B) was digested with the restriction enzymes 



WO 2004/099422 



PCT/US2004/009219 



-109- 

EcoRI, Ndel, and Xbal and the 16,431 bp EcoRI to Ndel fragment and the 
9043 bp Ndel to Xbal fragment were isolated and purified. The three 
DNA fragments were ligated to generate the plasmid pSQIHSPRGD 
(Figure 15A). 

5 To generate a recombinant adenoviral vector containing the HSP 

mutation in the fiber gene along with a cRGD ligand in the HI loop, the 

plasmid pSQIHSPRGD was digested with Clal and co-transfected into 

633 cells with pAdmireRSVnBg which had been digested with Sail and 

Pad. After propagation on 633 cells, the virus capsid contained wildtype 

10 and mutant fiber proteins. To obtain viral particles containing only the 

modified fiber with the HSP mutation and a cRGD ligand, the viral 

preparation was used to infect PerC6 cells, which do not express fiber. 

The resulting virus, termed Av1 nBgS*RGD, was purified by standard CsCI 

centrifugation procedures. 

15 Generation of vector containing the HSP fiber shaft mutation, the KOI 
fiber knob mutation, and a cRGD ligand in the HI loop 

The following strategy was used to generate an adenoviral vector 

containing a fiber with the HSP shaft mutation, the KOI fiber knob 

mutation, and a cRGD ligand in the HI loop. The plasmid p5FloxHRFRGD 

20 was digested with the restriction enzymes BstXI and Kpnl and the 1 157 
bp fragment was isolated and purified. In addition, the fiber shuttle 
plasmid pFBSEHSPKOI, described in Example 1 above, was digested with 
BstXI and Kpnl and the 4549 bp and 3156 bp fragments were isolated 
and purified. The three fragments were ligated to generate the plasmid 

25 pFBSEHSPKOI RGD, which encodes a fiber containing the HSP mutation, 
the K01 mutation, and cRGD in the HI loop. The fiber gene from this 
plasmid was transferred into the pSQ1 backbone as follows. The plasmid 
pFBSEHSPKPOl RGD was digested with EcoRI and Xbal and the 7601 bp 
fragment was isolated and purified. The plasmid pSQ1 (Figure 3B) was 
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digested with the restriction enzymes EcoRI, Ndel, and Xbal and the 
16,431 bp EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal 
fragment were isolated and purified. The three DNA fragments were 
ligated to generate the plasmid pSQ1 HSPK01 RGD (Figure 15B). 
5 To generate a recombinant adenoviral vector containing the HSP 

and K01 mutations in the fiber gene along with a cRGD ligand in the HI 
loop, the plasmid pSQ1 HSPK01 RGD was digested with Clal and 
co-transfected into 633 cells with pAdmireRSVnBg which had been 
digested with Sail and Pad. After propagation on 633 cells, the virus 

10 capsid contained wildtype and mutant fiber proteins. To obtain viral 
particles containing only the modified fiber with the HSP and KOI 
mutations and a cRGD ligand, the viral preparation was used to infect 
PerC6 cells, which do not express fiber. The resulting virus, termed 
Av1 nBgFKOI S*RGD, was purified by standard CsCI centrifugation 

15 procedures. 

EXAMPLE 8 

In Vitro Evaluation of Adenoviral Vectors Containing the HSP Fiber Shaft 
Mutation with or without the Fiber Knob KOI Mutation and with or 
without a cRGD Ligand in the HI Loop 

20 The transduction efficiencies of adenoviral vectors containing the 

HSP fiber shaft mutation with or without the fiber K01 mutation and with 
or without the cRGD ligand in the HI loop were evaluated on A549 cells. 
The transduction efficiencies were compared to that of AvInBg, an 
adenoviral vector containing wild type fiber. The day prior to infection, 

25 cells were seeded into 24-well plates at a density of approximately 1 x 
10 5 cells per well. Immediately prior to infection, the exact number of 
cells per well was determined by counting a representative well of cells. 
Each of the vectors, AvInBg, AvInBgS*, Avl nBgFKOI S*, 
Av1nBgS*RGD, and Avl nBgFKOI S*RGD, were used to transduce A549 

30 cells at a particle to cell ratio of 6250. The cell monolayers were stained 
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with X-gal 24 hours after infection and the percentage of cells expressing 
/?-galactosidase was determined by microscopic observation and counting 
of cells. Transductions were done in triplicate and three random fields in 
each well were counted, for a total of nine fields per vector. The results 
5 (Figure 16) showed that the cRGD ligand dramatically increased the 

transduction efficiencies of vectors containing the HSP mutation alone or 
combined with the K01 mutation. AvlnBgS* yielded approximately 22% 
positive cells, while Av1nBgS*RGD yielded approximately 95% positive 
cells. Similarly, Av1nBgFK01S* yielded only 4% positive cells, while 
10 Av1nBgFK01S*RGD yielded 85% positive cells. Therefore, the vector 
containing the shaft mutation is viable and can be retargeted with the 
addition of a ligand. 

EXAMPLE 9 

Construction Of Ad5 Vectors Containing The Ad35 Fiber And Derivatives 
15 Thereof 

The K01 and HSP mutations in the Ad5 fiber protein (5F), 
described above, were designed to ablate interactions that are responsible 
for the normal tropism of the Ad5 virus. An alternative strategy to 
detarget the virus is to replace the Ad5 fiber with a fiber from another 

20 serotype which does not bind CAR and which does not possess the 

heparin sulfate proteoglycan (HSP) binding domain (KKTK; SEQ ID NO. 
45) within the shaft. The fiber of adenovirus serotype 35 (35F) does not 
bind CAR and does not possess the HSP binding domain in its shaft. 
Replacement of the 5F with the 35F can detarget the liver and provide a 

25 suitable platform for retargeting the vector to the desired tissue. 

Generation of an Ad5 based vector containing the Ad35 fiber: A 
PCR SOEing strategy was used to generate a vector based on the Ad5 
serotype but containing the Ad35 fiber in place of the Ad5 fiber. First, 
PCR was used to amplify a region in the plasmid pSQl between the Xbal 
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site at bp 25,309 and the start of the fiber gene. The primers used for 
this reaction were P-0005/U and P-0006/L. The DNA sequence of 
P-0005/U was as follows: 5' C TCT AGA AAT GGA CGG AAT TAT TAC 
AG 3' (SEQ ID NO. 65). This sequence corresponds to bp 25,308 
5 through 25,334 of pSQ1. The DNA sequence of P-0006/L was as 
follows: 5' TCT TGG TCA TCT GCA ACA ACA TGA AGA TAG TG 3' 
(SEQ ID NO. 66). It contains a 10 base pair 5' extension that is 
complementary to the start of the Ad35 fiber gene, while the remainder of 
the primer anneals to the sequence immediately 5' of the ATG start codon 

10 of the fiber gene in pSQ1 . A PGR product of the expected size, 583 bp, 
was obtained and the DNA was gel purified. A second PCR amplified the 
Ad35 fiber gene using DNA extracted from wildtype Ad35 virus as a 
template. The primers used for this reaction were P-0007/U and 
35FMun. The DNA sequence of P-0007/U was as follows: 5' GT TGT 

15 TGC AG ATG ACC AAG AGA GTC CGG CTC A 3' (SEQ ID NO. 67). It 
contains a 10 base pair 5' extension that is homologous to the 10 bp 
immediately prior to the ATG start codon of the fiber gene in Ad5. The 
remainder of the primer anneals to the start of the Ad35 fiber gene. The 
DNA sequence of 35FMun was as follows: 5' AG CAA TTG AAA AAT 

20 AAA CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT A GTT GTC 
GTC TTC TGT AAT GTA AGA A 3' (SEQ ID NO. 68). It contains a 46 
base pair 5' extension that is complementary to the region of the Ad5 
genome between the end of fiber and the Muni site 40 bp downstream of 
the fiber gene. In addition, the 5' extension encodes the last amino acid 

25 and stop codon of the Ad5 fiber gene. This region was retained in the 
vector because it contains the polyadenylation site for the fiber gene. 
The remainder of the primer anneals to the 3' end of the Ad35 fiber gene, 
up to the next to last amino acid codon. A PCR product of the expected 
size, 1027 bp, was obtained and the DNA was gel purified. The two PCR 
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products were mixed and joined together by PCR SOEing using primers 
P-0005/U and P-0009. The DNA sequence of P-0009 was as follows: 5' 
AG CAA TTG AAA AAT AAA CAC GTT G 3' (SEQ ID NO. 69). It 
corresponds to bp 27,648 through 27,669 of pSQ1 and overlaps the 
5 Muni site in that region. A PCR product of the expected size, 1590 bp, 
was obtained and gel purified. It was cloned into the plasmid 
pCR4blunt-TOPO (Invitrogen Corporation, Carlsbad CA) using the Zero 
Blunt TOPO PCR Cloning Kit from Invitrogen. This intermediate cloning 
step simplified DNA sequencing of the PCR SOEing product. The 

10 resulting plasmid, termed pTOPOAd35F, was digested with Xbal and 

Muni and the 1585 bp digestion product was gel purified and ligated with 
the 6477 bp fragment of pFBshuttle(EcoRI) digested with Xbal and Muni 
to generate the plasmid pFBshuttleAd35F. The Ad35 fiber gene was 
transferred from pFBshuttleAd35F into pSQ1 as follows. The plasmid 

15 pSQ1 was digested with EcoRI and the 24,213 bp fragment was gel 
purified. The plasmid pFBshuttleAd35F was linearized with EcoRI and 
ligated with the 24,213 bp fragment from pSQ1 . Restriction diagnostics 
were performed to screen for constructs containing the Ad35 fiber gene 
inserted into the pSQ1 backbone in the correct orientation. The pSQ1 

20 plasmid containing the Ad35 fiber gene in the proper orientation was 
termed pSQ1 Ad35Fiber (Figure 17A). To generate adenoviral vector 
containing the Ad35 fiber, pSQ1 Ad35Fiber was digested with Clal and 
co-transfected into 633 cells with pAdmireRSVnBg which had been 
digested with Sail and Pad. After propagation on 633 cells, the resulting 

25 virus contained Ad5 fiber and Ad35 fibers on its capsid. The virus was 
amplified on PerC6 cells to generate virus containing only the Ad35 fiber 
on its capsid. The resulting virus preparation was termed Av1nBg35F. 

Construction of adenoviral vectors containing chimeric fibers 
derived from Ad5 and Ad35: Two chimeric fiber constructs were prepared 
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by PCR gene overlap extension using plasmids containing the full length 
Ad5 or Ad35 fiber cDNAs as templates. The Ad5 fiber tail and shaft 
regions (5TS; amino acids 1 to 403) were connected with the Ad35 fiber 
head region (35H; amino acids 137 to 323) to form the 5TS35H chimera, 
5 and the Ad35 fiber tail and shaft regions (35TS; amino acids 1 to 136) 
were connected with the Ad5 fiber head region (5H; amino acids 404 to 
581) to form the 35TS5H chimera. The fusions were made at the 
conserved TLWT sequence at the fiber shaft-head junction. 

For the construction of the 5TS35H chimera, the pFBshuttle(EcoR1 ) 

10 plasmid was used as the template with primers P1 and P2 to generate the 
5' fragment. The 3' fragment was generated using the pFBshuttleAd35 
plasmid as the template with the P3 and P4 primers. The sequence of 
each primer used in the construction of these chimeric fibers is listed in 
Table 2. Amplified PCR products of the expected size were obtained and 

15 were gel purified. A second PCR was carried out with the end primers P1 

and P4 to join the two fragments together. The DNA fragment generated 

in the second PCR was digested with Xba1 and Muni and was cloned 

directly into pFBshuttle(EcoR1 ) to create the fiber shuttle plasmid 

pFBshutt!e5TS35H. 

20 TABLE 2 

Primers Used For The Exchange Of Fiber Shaft Regions Between Ad5 And 

Ad35 Fibers 



Primer 
designation 


Sequence 


SEQ ID 


P1 


5'-GAACAGGAGGTGAGCTTAGA-3' 


70 


P2 


B'-GTTAGGTGGAGGG I I I ATTCCGGTCCAC 
AAAGTTAGCTTATC-3' 


71 


P3 


5'-GATAAGCTAACTTTGTGGACCGGAATAAA 
CCCTCCACCTAAC-3' 


72 


P4 


5'-GTGGCAGGTTGAATACTAGG-3 


73 


P5 


5'-GTTAGGAGATGGAGCTGGTGTAGTCCATA 
AGGTGTTAATAC-3' 


74 
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Primer 
designation 


Sequence 


SEQ ID 


P6 


5'-GTATTAACACCTTATGGACTACACCAGCT 
CCATCTCCTAAC-3' 


75 


P7 


5'-TGCGCAAAAACAATCACCACGACAATCACAAT 
GTACATTGGAAGAAATCATACG-3' 


76 


P8 


5'-ACATTGTGATTGTCGTGGTGATT 
GTTTTTGCGCATATGCCATACAATTTGAATG-3' 


77 



5 For the construction of the 35TS5H chimera, the pFBshuttleAd35 

plasmid was used as the template with the P1 and P5 primers to generate 
the 5' fragment. The 3' fragment was generated using the 
pFBshuttle(EcoR1 ) plasmid as the template with the P6 and P4 primers. 
Following the same procedure described above, the fiber shuttle plasmid 

10 pFBshuttle35TS5H was generated. 

For the 35TS5H and 5TS35H chimeras, the fiber gene was 
transferred from the pFBshuttle(EcoRI) backbone into pSQ1 as described 
above for the vector containing the Ad35 fiber. The resulting plasmids 
were called pSQ135T5H (Figure 18A) and pSQ15T35H (Figure 18B). In 

15 addition, adenoviral vectors were generated using the co-transfection 
strategy described above. 

Construction of Ad5 vectors containing the Ad35 fiber with a 
cRGD targeting peptide in the HI loop of the 35F fiber knob: To 
incorporate the cRGD targeting peptide into the Ad35 fiber HI loop, the 

20 P7 and P8 oligonucleotide primers encoding the ten amino acid sequence 
HCDCRGDCFC (SEQ ID NO. 78) were synthesized. The pFBshuttleAd35 
plasmid containing the full length Ad35 fiber cDNA was used as the 
template in the PCR reaction with the P1 and P7 primer pair or with the 
P4 and P8 primer pair in order to generate the 5' and 3' PCR fragments. 

25 A second PCR was then carried out with the end primers PI and P4 to 
join the two fragments together. The resulting PCR fragment was 
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digested with Xba1 and Muni and was cloned into pFBshutt!e(EcoRl ) to 
create the fiber shuttle plasmid pFBshuttleAd35cRGD. The modified 
Ad35 fiber gene was transferred into pSQ1 using the EcoRI cloning 
strategy described above to generate pSQI Ad35FcRGD (Figure 17B). 
5 Adenoviral vector was generated using the co-transfection strategy 
described above. 

EXAMPLE 10 

In Vitro Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
Thereof 

10 The transduction efficiencies of adenoviral vectors containing the 

35F or derivatives thereof were evaluated on A549 cells. The 
transduction efficiencies were compared to that of AvInBg, an adenoviral 
vector containing the 5F fiber. The day prior to infection, cells were 
seeded into 24-well plates at a density of approximately 1 x 10 5 cells per 

15 well. Immediately prior to infection, the exact number of cells per well 
was determined by counting a representative well of cells. Each of the 
vectors, AvInBg, Av1nBg35F, Av1nBg5T35H and AvlnBg35T5H were 
used to transduce A549 cells from 0 up to 1 ,000 particle per cell (PPC) 
ratios. The cell monolayers were stained with X-gal 24 hours after 

20 infection and the percentage of cells expressing £-galactosidase was 
determined by microscopic observation and counting of cells. 
Transductions were done in triplicate and three random fields in each well 
were counted, for a total of nine fields per vector. The results (Figure 19) 
showed similar transduction efficiencies on A549 cells using the 

25 Av1nBg35F and Av1nBg5T35H vectors compared to AvInBg. The 

Av1nBg35T5H showed much lower transduction efficiencies on A549 
cells compared to AvInBg as a result of the Ad35 shaft domain. The 
Ad35 shaft domain does not contain a HSP binding motif and the 
Av1nBg35T5H vector behaves similarly to the AvInBgS* vector in vitro 
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and in vivo. These studies also demonstrate that vectors containing fiber 
proteins without an HSP binding site are fully viable. 

EXAMPLE 1 1 

in Vivo Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
5 Thereof 

The objective of this study was to evaluate the in vivo 
biodistribution of adenoviral vectors containing 35F fibers and derivatives 
thereof to determine whether vectors containing these fibers ablate liver 
transduction due to their shaft regions. A positive control cohort received 

10 AvInBg and a negative control group received HBSS. Cohorts of five 

C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 
10 13 particles per kg. The animals were sacrificed approximately 72 hours 
after vector administration by carbon dioxide asphyxiation. Liver, heart, 
lung, spleen, and kidney were collected from each animal. The median 

15 lobe of the liver was placed in neutral buffered formalin to preserve the 
sample for /J-galactosidase immunohistochemistry. In addition, tissue from 
each organ was frozen to preserve it for hexon PCR analysis to determine 
vector content. A separate sample of liver from each mouse was frozen 
to preserve it for a chemiluminescent /?-galactosidase activity assay. JS- 

20 galactosidase immunohistochemistry, hexon real-time PCR and the 
chemiluminescent /?-ga!actosidase activity assay were carried out as 
described in example 3. 

The results of the /?-galactosidase activity assay showed a dramatic 
reduction in liver transduction by vectors containing the Ad35 fiber or the 

25 35T5H derivative (Figure 20) with an approximately 4- to 24-fold 

reduction in >ff-galactosidase activity in the liver compared to the control 
vector AvInBg. These data demonstrate that shaft domains without HSP 
binding sites can effectively ablate hepatic in vivo gene transfer. In 
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particular, HSP is the major entry mechanism for liver in vivo. CAR 
binding is a minor entry pathway. 

EXAMPLE 12 

Construction Of Ad5 Vectors Containing The Ad Serotype 41 Short Fiber 
5 And Derivatives Thereof 

The human adenovirus serotype 41 contains two different fibers on 

its capsid, encoded by two adjacent genes. One fiber has a molecular 

weight of 60kDa and is approximately 31 5A in length and is termed the 

long fiber. The other fiber has a molecular weight of 40kDa and is 

10 approximately 250-j- in length and is termed the short fiber. The Ad41 
short fiber does not bind CAR and does not possess the heparin binding 
domain (KKTK) in its shaft. Therefore, this fiber provides a useful 
platform for adenoviral vector targeting. 

Construction of adenoviral vectors based on Ad5 but containing the 

15 Ad41 short fiber: A PCR SOEing strategy was used to generate a vector 
based on the Ad5 genome but containing the Ad41 short (Ad41s) fiber. 
First, PCR was used to amplify the region of pSQ1 between the Xbal site 
at bp 25,309 and the start of the fiber gene. The primer pair used for the 
PCR were P-0005/U and P-0010/L. The DNA sequence of P-0005/U was 

20 as follows: 5' C TCT AGA AAT GGA CGG AAT TAT TAG AG 3' (SEQ ID 
NO. 65). The sequence corresponds to bp 25,308 through 25,334 of 
pSQ1 and overlaps the Xbal site in that region. The DNA sequence of 
P-0010/L was as follows: 5' TTC TTT TCA T CTG CAA CAA CAT GAA 
GAT AGT G 3' (SEQ ID NO. 79). It contains a 5' extension 

25 corresponding to the first 10 bp of the Ad41s fiber gene. The remainder 
of the primer anneals to pSQ1 immediately 5' of the ATG start codon of 
the fiber gene. The PCR product was the expected size (583 bp). A 
second PCR was used to amplify the Ad41s fiber using the plasmid 
pDV60Ad41sF as a template. The primers used were P-001 1/U and 
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P-0012/L. The DNA sequence of P-001 1/U was as follows: 5' GT TGT 
TGC AG ATG AAA AGA ACC AGA ATT GAA G 3' (SEQ ID NO. 80). It 
contains a 10 bp 5' extension corresponding to the DNA sequence 
immediately 5' of the ATG start codon of the fiber gene in pSQ1 . The 
5 remainder of the primer anneals to the beginning of the Ad41s fiber gene 
in pDV60Ad41sF. The DNA sequence of P-001 2/L was as follows: 5' TG 
CAA TTG AAA AAT AAA CAC GTT GAA ACA TAA CAC AAA CGA TTC 
TTT ATT C TTC AGT TAT GTA GCA AAA TAG A 3' (SEQ ID NO. 81). It 
contains a 51 bp 5' extension corresponding to the sequence in pSQ1 

10 from the last codon of the fiber gene through the Munl site 40 bp 

downstream of the fiber gene. The remainder of the primer anneals to the 
3' end of the Ad41s fiber gene in pDV60Ad41sF. The PCR product was 
the expected size (1219 bp), The two PCR products were joined by PCR 
SOEing using primers P-0005/U and P-0009/L. The DNA sequence of 

15 P-0009/L was described above. The PCR SOEing reaction yielded the 
expected 1 782 bp product. The product was cloned into 
pCR4blunt-TOPO to yield pCR4blunt-TOPOAd41 sF. Next, 
pCR4blunt-T0P0Ad41sF was digested with Xbal and Muni and the 1773 
bp fragment containing the Ad41s fiber gene was gel purified. This 

20 fragment was ligated with the 6477 bp Xbal to Munl fragment of 

pFBshuttle(EcoRI) to generate pFBshuttleAd41 sF. The Ad41s fiber gene 
was transferred into the pSQ1 backbone as follows. First, 
pFBshuttleAd41 sF was linearized using EcoRI and this fragment was 
ligated with the 24,213 bp EcoRI fragment of pSQ1 to generate 

25 pSQ1Ad41sF (Figure 21A). Adenoviral vector containing the Ad41s fiber 
was generated using the co-transfection strategy described above. 

Construction of Ad5 adenoviral vectors containing the Ad41 short 
fiber with a cRGD targeting ligand in the HI loop: A PCR SOEing strategy 
was used to generate a construct containing the Ad41s fiber with cRGD 



WO 2004/099422 



PCT/US2004/009219 



-120- 

in the HI loop. The plasmid pFBshuttleAd41 sF was used as a template 
for the PGR amplifications. First, a 1782 bp fragment was amplified using 
primers 5FF and 41 sRGDR. The primer 5FF was described above. It 
anneals to pFBshuttleAd41 sF at the Xbal site upstream of the fiber gene. 
5 The DNA sequence of the primer 41 sRGDR was as follows: 5' AGT ACA 
AAA ACA ATC ACC ACG ACA ATC ACA GTT TAT CTC GTT GTA GAC 
GAC ACT GA 3' SEQ ID NO. 82). It contains a 30 bp 5' extension that 
encodes the cRGD targeting ligand. The remainder of the primer anneals 
to pFBshuttleAd41sF from bp 2878 through 2903. A second PCR 

10 amplified a 277bp region of pFBshuttleAd41 sF using primers 3FR and 
41sRGDF. The primer 3FR was described previously. It anneals to 
pFBshuttleAd41sF at the Muni site downstream of the fiber gene. The 
DNA sequence of 41 sRGDF was as follows: 5' TGT GAT TGT CGT GGT 
GAT TGT TTT TGT ACT AGT GGG TAT GCT TTT ACT TTT 3' (SEQ ID 

15 NO. 83). It contains a 30 bp 5' extension that encodes the cRGD 

targeting ligand and is complementary to the extension on 41sRGDR. The 
remainder of the primer anneals to pFBshuttleAd41 sF from bp 2904 
through 2924. The two PCR products were joined by PCR SOEing to 
generate a 2059 bp fragment using primers 5FF and 3FR. The product 

20 was digested with Xbal and Muni and the 1803 bp DNA fragment was 
gel purified. The fragment was ligated with the 6477 bp fragment 
resulting from digestion of pFBshuttle(EcoRI) with Xbal and Muni. The 
resulting plasmid was termed pFBshuttleAd41 sRGD. This plasmid was 
linearized by EcoRI digestion and ligated with the 24,21 3bp EcoRI 

25 fragment of pSQ1 to generate pSQ1Ad41sRGD (Figure 21 B). 
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EXAMPLE 13 

In Vivo Evaluation Of Ad5 Vectors Containing The Ad41 Short Fiber And 
Derivatives Thereof 

This example evaluates the in vivo biodistribution of adenoviral 

5 vectors containing 41 sF fibers and derivatives thereof to determine 

whether vectors containing the these fibers ablate liver transduction due 

to modified shaft regions. A positive control cohort received Av3nBg 

(see, Gorziglia et al. (1996) J. Virology 70:4173-4178) or 

Ad5./?Gal.AF/5F, and a negative control group received HBSS. 

10 Ad5./?Gal.AF/5F is a derivative of the fiberless vector Ad5.j5gal.AF (ATCC 
accession number VR2636) modified to express ADS fiber (see, e.g., 
International PCT application No. WO 01/83729). 

The Ad5./?GaI.AF vector was pseudotyped with the Ad41sF fiber 
protein and injected in vivo. Cohorts of five C57BL/6 mice received each 

15 vector via tail vein injection at a dose of 1 x 10 13 particles per kg. The 
animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, 
and kidney were collected from each animal. The median lobe of the liver 
was placed in neutral buffered formalin to preserve the sample for 

20 /?-ga!actosidase immunohistochemistry. In addition, tissue from each 
organ was frozen to preserve it for hexon PGR analysis to determine 
vector content. A separate sample of liver from each mouse was frozen 
to preserve it for a chemiluminescent jff-galactosidase activity assay. /?- 
galactosidase immunohistochemistry, hexon real-time PCR and the 

25 chemiluminescent /?-ga!actosidase activity assay was carried out as 
described in Example 3. 

The results of the hexon DNA analysis showed a dramatic 
reduction in liver transduction by vectors containing the Ad41sF fiber 
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(Figure 22) with an approximately a 5-fold reduction in liver adenoviral 
DNA content compared to either control vector. 

In the above examples, several novel adenoviral vectors were 
generated containing various fiber modifications designed to ablate the 
5 normal tropism of the vector (see Table 3). Vectors were generated in 
which the heparan sulfate binding domain in the fiber shaft was replaced 
by amino acid substitutions. This mutation, termed HSP, was also 
combined with the KOI mutation (fiber knob AB loop mutation that 
ablates CAR binding), and the PD1 mutation (penton mutation that 

10 eliminates RGD/integrin interaction). In addition, a vector containing all 
three mutations (HSP, K01, PD1) was generated. All vectors containing 
the HSP mutation, either alone or combined with other capsid 
modifications, showed dramatically reduced transduction efficiencies on 
A549 and HeLa cells. Furthermore, the same vectors showed drama- 

15 tically reduced transduction of the liver following systemic delivery to 
mice. As an alternative strategy to ablate the normal tropism of 
Ad5-based vectors, the Ad5 fiber was replaced by a fiber from a different 
adenovirus serotype which does not bind CAR and does not contain the 
heparan binding domain in the shaft. Thus, vectors were generated 

20 containing the Ad35 fiber and the Ad41 short fiber. Versions of these 
two vectors containing a cRGD targeting ligand in the HI loop of the fiber 
were also produced. Additionally, vectors containing chimeric fibers were 
generated. A vector containing the Ad35 fiber tail and shaft regions 
fused to the Ad5 fiber knob domain as well as a vector containing the 

25 Ad5 fiber tail and shaft fused to the Ad35 fiber knob domain were 

constructed. Vectors containing either the entire Ad35 or Ad41 short 
fiber showed a significant reduction in liver transduction following delivery 
to mice via the tail vein. The observation of reduced liver transduction 
using vectors containing either an HSP mutation, the Ad35 fiber, or the 
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Ad41 short fiber indicates the feasibility of detargeting adenoviral vectors 
in vivo. In vitro data with the Ad35 fiber or the Ad41 short fiber with 
cRGD (see Example 14) indicate that the virus is completely viable, that 
is, it is not damaged by the absence of an HSP binding site and is 
5 retargetable. Taken together these data suggest that these vectors 
provide a suitable platform for retargeting strategies. 

TABLE 3 

Description Of Recombinant Adenoviral Vectors Used 
To Demonstrate That Shaft Modifications Influence Tropism In Vivo 
10 Vector 



15 



Vector 


Description 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing ^-gaiactosidase 


Ad5 Fiber derivatives: 


Av1nBgFK01 


The same as AvInBg but containing the K01 AB loop 
mutation in the fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the penton PD1 mutation 
that deletes the integrin binding, RGD tripeptide 


AvInBgS* 


The same as AvInBg but containing the 4 amino acid 
substitution in the shaft referred to as S* that modifies the 
HSP binding motif 


AvInBgFKOIS* 


The same as AvInBg but containing the fiber KOI and S* 
mutations combined 


Av1nBgS*PD1 


The same as AvInBg but containing the fiber S* and penton 
PD1 mutations combined 


Av1nBgFK01S*PD1 


The same as AvInBg but containing the fiber K01, S* and 
penton PD1 mutations combined 


Ad35 fiber derivatives: 


Av1nBg35F 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA 


Av1nBg5T35H 


The same as AvInBg but containing the 5T35H chimeric fiber 


Av1nBg35T5H 


The same as AvInBg but containing the 35T5H chimeric fiber 


Av1nBg35FRGD 


The same as AvInBg but containing the full length Ad35 fiber 
cDNA with a cRGD ligand in the HI loop of the Ad35 fiber 


Ad41sF fiber derivatives: 
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Vector 


Description 


Av1nBg41sF 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA 


Av1 nBg41 sFRGD 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA with a cRGD ligand in the HI loop of the Ad41 
short fiber 



EXAMPLE 14 

5 In Vitro Evaluation Of Adenoviral Vectors Containing The Ad41sF With A 
cRGD Ligand In The HI Loop 

The transduction efficiencies of adenoviral vectors containing the 

Ad41sF fiber with the cRGD ligand in the HI loop were evaluated on 

A549 cells. The transduction efficiencies were compared to that of 

10 AvInBg, an adenoviral vector containing wild type fiber or 

Av1 nBgFKOl RGD, an adenoviral vector containing the K01 mutation in 
combination with the cRGD ligand in the HI loop. The day prior to 
infection, cells were seeded into 24-well plates at a density of 
approximately 1 x 10 5 cells per well. Immediately prior to infection, the 

15 exact number of cells per well was determined by counting a 
representative well of cells. Each of the vectors, AvInBg, 
Av1 nBgFKOl RGD, and Av1 nBg41 sFRGD were used to transduce A549 
cells at a particle to cell ratios of 0 up to 10,000. The cell monolayers 
were stained with X-gal 24 hours after infection and the percentage of 

20 cells expressing /?~galactosidase was determined by microscopic 

observation and counting of cells. Transductions were done in triplicate 
and three random fields in each well were counted, for a total of nine 
fields per vector. The results (Figure 23) show that the Av1 nBg41 sFRGD 
vector transduced cells to an equivalent level as Av1 nBgFKOl RGD at all 

25 vector doses examined. Neither FK01 or Ad41sF can bind CAR. The 
Ad41sF does not normally interact with CAR and additionally does not 
contain the HSP binding motif within the shaft domain. These data show 
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that targeting peptides inserted into the loop regions of the fiber knob of 
K01 and Ad41sF allows for transduction of target cells via the targeted 
receptor. Surprisingly, HSP, not CAR and integrins, is the major entry 
route in vivo and ablation of HSP binding permits targeting of adenoviral 
5 vectors. 

EXAMPLE 15 

Effect of the shaft modification on the biodistribution of adenoviral 
vectors in vivo 

The influence of fiber and penton modifications on the in vivo 
10 biodistribution of adenoviral vectors containing fiber head, shaft and 
penton mutations was examined. Vectors containing the HSP mutation 
combined with K01, or PD1, or a combination of all three mutations were 
evaluated as well as vectors containing the K01 mutation alone and the 
PD1 mutation alone. The indicated adenoviral vectors were systemically 
15 administered to C57BL6 mice as described above. A positive control 
cohort received AvInBg and a negative control group received HBSS. 
Cohorts of five C57BL/6 mice received each vector via tail vein injection 
at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
approximately 72 hours after vector administration by carbon dioxide 
20 asphyxiation. Liver, heart, lung, spleen, and kidney were collected from 
each animal. Tissue from each organ was frozen to preserve it for real 
time PCR analysis to determine adenoviral hexon DNA content. A 
separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. Hexon real-time PCR 
25 and the chemiluminescent ^-galactosidase activity assay was carried out 
as described in Example 3. 

The results derived from the liver are described in Example 6 
(Figure 14A and B) and also shown in Figure 26 with results presented as 
percent control of AvInBg. The effect of the S* shaft modification on 
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the biodistribution of adenovirus to the other organs is shown in Figure 
25. The average adenoviral DNA content was determined as adenoviral 
genomic copies per cell and expressed as a percentage of the AvInBg 
( + ) control value. The average percent control value + standard 
5 deviation is shown (n = 5 per group) for each tissue examined (Figure 25). 

Systemic delivery of Ad5 based vectors with wild-type fiber results 
in a preferential accumulation of vector DNA in the liver with 64 copies 
per cell with significantly less DNA found in the other organs with 1 .32 
copies per cell found in lung, 2.18 copies per cell in spleen, 0.47 copies 

10 per cell found in heart, and 0.72 copies per cell in the kidney. All 
differences found with PD1, S*, K01PD1, KQ1S*, S*PD1, and 
K01S*PD1 were significantly different than the AvInBg ( + ) control using 
a unpaired, t-test analysis, P value ( 0.024. When expressed as a percent 
of the AvInBg control values, the influence of each mutation, individually 

15 or in combination, becomes apparent. The S* mutation dramatically 

reduced gene transfer to all four organs, whereas, the K01 mutation did 
not. Thus, the importance of the shaft for transduction in vivo extends to 
organs besides the liver. Finally, gene transfer to the lung, heart, and 
kidney was diminished with PD1 suggesting a role for integrin binding in 

20 vector entry in these organs. 

EXAMPLE 16 

Retargeting the S*, shaft modification and the 41sF fiber in vivo 

Vectors containing the HSP mutation have been shown to 
effectively detarget adenoviral vectors in vivo (see examples 6 and 15). 
25 The objective of this study was to evaluate the ability to retarget vectors 
containing the S* modification or the Ad41sF to tumors in vivo. A cRGD 
peptide was genetically incorporated into the fiber HI loop and evaluated 
in vitro (Examples 8 and 14). These same vectors were then evaluated in 
vivo in tumor-bearing mice. Athymic nu/nu female mice were injected 
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with 8 x 10 6 A549 cells on the right hind flank. When tumors reached 
approximately 1 00mm3 in size, they were randomized into treatment 
groups. Cohorts of 6 mice received each vector via tail vein injection at a 
dose of 1 x 10 13 particles per kg. The animals were sacrificed 
5 approximately 72 hours after vector administration by carbon dioxide 

asphyxiation. Tumor, liver, heart, lung, spleen, and kidney were collected 
from each animal. Tissue from each organ was frozen to preserve it for 
real time PCR analysis to determine adenoviral hexon DNA content. 
Hexon real-time PCR was carried out as described in example 3. A 

10 separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. Hexon real-time PCR 
and the chemiluminescent /?-galactosidase activity assay was carried out 
as described in example 3. 

The adenoviral vector biodistribution to the liver and tumor for each 

15 treatment group is shown in Figure 27. Vectors containing the S*, 
KQ1S*, and 41 sF fibers effectively detargeted the liver and tumor 
resulting in a significant reduction in the amount of adenoviral DNA found 
in each tissue in comparison to the AvInBg control. Vectors containing 
the cRGD targeting ligand restored tranduction of the tumors to levels 

20 comparable to that achieved with the untargeted vector. 

These data demonstrate successful liver detargeting accompanied 
with tumor retargeting. The extent of tumor retargeting is relates to the 
affinity and type of ligand that is used. These data demonstrate the 
successful development of a targeted, systemically deliverable adenoviral 

25 vector that will target tumors in vivo. 
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EXAMPLE 17 

Scale-Up Method For The Propagation Of Detargeted Adenoviral Vectors 

The growth and propagation of doubly or triply ablated adenoviral 
vectors requires novel scale up technologies. These detargeted vectors 
5 require alternative cellular entry strategies to allow for the efficient 

growth and generation of high titer preparations. A strategy for vector 
growth that is generally applicable to all detargeted adenoviral vectors, 
that does not require the development of new cell lines, and that aslo can 
be used for generating targeted vectors is provided herein. 

10 Three recombinant adenoviral vectors were prepared that contain 

single mutations in the fiber or penton or both mutations combined into 
one vector. These vectors are designated Av3nBgFK01, Av1nBgPD1, and 
Av1 nBgFKOI PD1 , respectively. The construction of these vectors is 
described above and a general description of each vector can be found in 

15 Table 1 above. 

Scale-up of detargeted adenoviral vectors: A polycation, specifically 
hexadimethrine bromide was obtained from Sigma Chemical Co (St. Louis, 
MO), Catalog No. 52495, and was maintained in the medium at 4 //g/ml 
during the course of transfections and infections. To illustrate the affects 

20 of hexadimethrine bromide on the yield of detargeted adenoviral vectors 
the following experiment was carried out. Seven plates of AE1-2a 
adenoviral producer cells (Gorziglia eta/. (1996) J. Virology 
70:4173-4178) were transduced with 10 particles per cells of each of the 
indicated vectors (See Table 4). Each vector was incubated with medium 

25 (Richters with 2% HI-FBS) containing hexadimethrine bromide at 4 //g/ml 
for 30 min at room temperature prior to infection. The infection was 
carried out for 2 hrs. Complete medium containing hexadimethrine 
bromide at 4 //g/ml was added to each plate. Final concentration of 
hexadimethrine bromide in all of these experiments was maintained at 
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4 //g/ml. The titers were determined spectrophotometrically using the 
conversion of 10D at A260nm per 1 x 10 12 particles (Mittereder et al. 
(1996) J Virology 70:7498-7509). The total particle yield was then 
normalized for the number of plates used for transduction. 
5 The inclusion of hexadimethrine bromide in the medium during the 

course of infection allows for the efficient propagation of detargeted 
adenoviral vectors containing fiber and penton mutations either alone or in 
combination. The effect of hexadimethrine bromide on vector yields is 
shown in Table 4. A 35-fold improvement in the yield of AvSnBgFKOI 

10 was found when hexadimethrine bromide was included in the culture 
medium and resulted in increasing the vector yield from 1 .3 x 10 10 up to 
4.6 x 10 11 vector particle per plate. Hexadimethrine bromide has a 
minimal effect on the yield of the Av1nBgPD1 adenoviral vector 
containing the penton, PD1 mutation with only a 1.2 fold improvement. 

15 The greatest effect using hexadimethrine bromide was found on the 
propagation of the doubly ablated adenoviral vector, Av1 nBgFKOI PD1 
with increases in vector yield from barely detectable levels up to 4.53 x 
10 10 vector particles per plate. These data demonstrate that use of 
nonspecific entry mechanisms allows for the efficient scale-up of 

20 detargeted adenoviral vectors. 



TABLE 4 

Efficient Scale-Up Of Detargeted Adenoviral Vectors Using 
hexadimethrine bromide 



Vector 


Vector Yield (particles/plate) 


Fold 
Improvement 


(-) hexadimethrine 
bromide 


( + ) hexadimethrine 
bromide 


AvInBg 


3.89 x 10 11 


5.72 x 10 11 


1.47 


Av3nBg 


8.58 x 10 10 


2.38 x 10 11 


2.77 


Av3nBgFK01 


1.30 x 10 10 


4.60 x 10 11 


35.4 


Av1nBgPD1 


1.95 x 10 11 


2.40 x 10 11 


1.23 
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Vector 


Vector Yield (particles/plate) 


Fold 
Improvement 


{-) hexadimethrine 
bromide 


{ + ) hexadimethrine 
bromide 


Av1 nBgFKOI PD1 


TLTC* 


4.53 x 10 10 


T 



*TLTC: Too low to count, a faint virus band was collected and the particle 
concentration was too dilute for titer determination. 
5 f Significant improvement 

The use of alternative polycations including protamine sulfate and 

poly-lysine as well as Afunctional proteins such as the anti-penton:TNFa 

fusion protein was investigated. Figure 24 show results that demonstrate 

all the reagents tested had some effect on enhancing transduction of the 

10 Av3nBgFK01 vector. All of these compounds, when maintained in the 
medium during infection, enhanced transduction of the Av3nBgFK01 
detargeted adenoviral vector. 

Bifunctional reagents:. The use of bifunctional reagents for the 
propagation of detargeted adenoviral vectors was examined using the 

15 anti-penton:TNFa fusion protein. This particular reagent is a fusion protein 
between an antibody against Ad5 penton and the TNFa protein that is 
produced using stably transfected insect cells. This reagent will bind 
specifically to the adenoviral capsid via penton base and allow for binding 
to cell surface TNF receptors. The use of this reagent for the propagation 

20 of detargeted vectors is illustrated in Table 5 using Av3nBgFK01 (also 
shown in Figure 24). Monolayers of S8 cells were infected with 10 or 
100 particles per cell of Av3nBgFK01 or a control vector in the presence 
or absence of 1 //g/ml of the anti-penton:TNFa fusion protein. The 
monolayers were visually inspected over time for vector spread as 

25 indicated by the extent of cytopathic effect (CPE). The percentage of 
CPE at each time point is shown. The use of this bifunctional reagent 
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clearly enhances the spread of the Av3nBgFK01 vector throughout the 
monolayer. 

TABLE 5 

Efficient Scale-Up Of Detargeted Adenoviral 
5 Vectors Using Bifunctional Reagents: Anti-Penton:TNFcr 





10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


1 00 ppc + 
anti-penton TNF 




Percentage of CPE 


AdBLud 


24 h 


0% 


0% 


0% 


0% 


48 h 


20-30% 


20-30% 


90-100% 


90-100% 


72 h 


60-70% 


80-90% 


100% 


100% 


120 h 


100% 


100% 


100% 


100% 


Av3nBgK01 24hrs 


24 h 


0% 


0% 


0% 


0% 


48 h 


0% 


10-20% 


0% 


90-100% 


72 h 


5% 


60-70% 


5% 


100% 


120 h 


40-50% 


100% 


100% 


100% 



20 EXAMPLE 18 

This Example and the following Example describe construction of 

adenoviral Ad5 particles that express heterologous fibers. The fibers are 

modified at the N-terminus to increase incorporation into the Ad5 particle. 

The N-terminus, typically, the first at least 1*6 or 17 amino acids is 

25 modified so that the sequence resembles the Ad5 terminus. 

Expression of Fiber Proteins from Subgroups B, C and D of Human 
Adenovirus 

Constructs for expression of fibers from several adenoviral 
30 serotypes including, Types 16, 30, and 35 (subgroup B), and Types 1 9p 
and 37 (subgroup D), were generated. 
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Construction of Ad37, Ad19p and Ad30 fiber expression plasmids 

The open reading frames (ORFs) of Ad37 (SEQ ID NO. 31), Ad19p 
(SEQ ID NO. 33) or Ad30 (SEQ ID NO. 35) fiber proteins were PCR 
amplified using the following primers: 
5 Forward primer (L37): TGT CTT GGA TCC AAG ATG AAG CGC 

GCC CGC CCC AGC GAA GAT GAC TTC (SEQ ID NO. 84) 

Reverse primer (37FR): AAA CAC GGC GGC CGC TCT TTC ATT 

CTT G (SEQ ID NO. 85) 

Primers L37 and 37FR include BamHI and Notl sites (in bold), 

10 respectively, to facilitate subcloning. In addition, primer L37 introduces 
mutations into the 5' end of each fiber protein so that the resulting fiber 
proteins more closely resemble the Ad5 fiber N-terminal sequence for 
assembly onto Ad5 particles. For Ad37, the native N-terminus and 
modified N-terminus have the following amino acid sequences: 

15 Native Ad37 N-terminus: MSKRLRVE (SEQ ID NO. 86) 

Modified Ad37 N-terminus: MKRARPSE (SEQ ID NO. 87) 
The amplified fibers were cloned into the BamHI and Notl sites of 
plasmid pCDNA3.1zeo( + ) (Invitrogen). The Ad5 tripartite leader (TPL) 
sequence from plasmid pDV55 (see EXAMPLE 20 and SEQ ID NO. 88), 

20 which is flanked by BamHI sites (SEQ ID NO. 88), was then subcloned 

into the BamHI site of each fiber expression plasmid to generate plasmids 
pDV121 (Ad37), pDV145 (Ad19p) and pDV164 (Ad30). Construction of 
plasmid pDV55 is set forth in EXAMPLE 20 (see also, copending U.S. 
application Serial No. 09/482,682, also filed as International PCT 

25 application No. PCT/US00/00265; and in U.S. application Serial No. 
09/562,934, also filed as International PCT application No. 
PCT/EP01/04863. The combination of the CMV promoter present in 
pCDNA3.1zeo( + ) and the addition of the TPL sequence from pDV55 
(SEQ ID NO. 88) provides for high-level expression of viral proteins. 
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Construction of Ad 16 and Ad35 fiber expression plasmids 

Ad 16 and Ad35 fiber expression plasmids were generated in a 
similar manner with the following modifications. To PCR amplify Ad16 
(SEQ ID NO. 37) and Ad35 (SEQ ID NO. 39) fiber, the forward primer 
5 was designed to incorporate an Ndel site (in bold), which is present at 
nucleotide 48 of Ad5 fiber (SEQ ID NO. 1), but absent in Ad16 and Ad35 
fiber sequences. The reverse primers contained a Notl site (in bold): 

Ad16/Ad35 forward primer (F16 5'): CCG GTC TAC CCA TAT 

GAA GATG (SEQ ID NO. 89) 
10 Ad16 reverse primer (F16 3'): TGG TGC GGC CGC TCA GTC ATC 

TTC TCTG (SEQ ID NO. 90) 

Ad35 reverse primer (F35 3'): TGG TGC GGC CGC TTA GTT GTC 
GTC TTC TGT AAT G (SEQ ID NO. 91) 

The Ad16 and Ad35 PCR products were cloned into the Ndel and 
15 Notl sites of pCDNA3.1zeo( + ), resulting in plasmids pDV147 (Ad16) and 
pDV165 (Ad35). The Ndel site of pCDNA3.1zeo( + ) is within the CMV 
promoter region of the plasmid, therefore, the resulting plasmids lacked 
the 3' portion of the CMV promoter region. In addition, the inserted fiber 
sequences were lacking the portion of the fiber sequence that is 5' to the 
20 engineered Ndel site. To insert the necessary regulatory sequences and 
N-terminal fiber sequence, plasmid pDV67 (described in Example 22 and 
also U.S. Application Serial No. 09/562,934, and is available from the 
ATCC under accession number PTA-1 145) was digested with Ndel to 
remove a fragment that contains the 3' portion of the CMV promoter, the 
25 complete Ad5 TPL sequence and the 5' portion of the Ad5 fiber 

sequence. The Ndel fragment was subcloned into plasmids pDV147 and 
pDV165 to generate the complete Ad 16 and Ad35 expression plasmids, 
pDV156 and pDV166, respectively. Expression of these constructs 
results in chimeric fiber proteins containing the 17 N-terminal amino acids 
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from Ad5 fiber (see SEQ ID NO. 2) and the remainder of the fiber 
sequence from either Ad1 6 or Ad35. The nucleotide sequences of the 
chimeric fibers are listed in SEQ ID NO. 41 (Ad5/Ad16) and SEQ ID NO. 
43 (Ad5/Ad35). 

5 Expression and trimerization of recombinant Ad fiber proteins 

To verify expression and trimerization of the recombinant proteins, 
the resulting plasmids were transfected into 293T cells, which are 
identical to 293 cells except they express an integrated SV40 large T 
antigen gene. 239 cells are an adenovirus-transformed human embryonic 

10 kidney cell line obtained from the ATCC, where they are deposited under 
Accession Number CRL 1573. 293T cells express CAR and a v integrins. 
Fiber expression was detected by immunoblotting of cell lysates using the 
4D2 monoclonal antibody (Research Diagnostics Inc., Flanders, N.J.), 
which recognizes an epitope conserved among fibers of different 

15 serotypes. To generate stable cell lines, constructs were electroporated 
into an A549-derived cell line that complements the Ad E1a and E2a 
functions (Gorziglia et al., J. Virol. 70:4173-4178 (1996)) and stable 
clones were derived by selection with zeocin. Clones that expressed high 
levels of the fiber protein were identified by immunoblotting with the 4D2 

20 antibody. 

Generation of Adenovirus Particles Pseudotyped with Subgroup B, 
C or D Ad Fiber Protein 

A system for producing Ad vector particles with different or 
25 modified fiber proteins, that therefore have altered tropism (pseudotyping) 
is known (see, e.g., Von Seggern et al., J. Virol. 74:354-362 (2000); Wu 
et al., Virology 275:78-89 (2001)). Briefly, an E1 -deleted Ad vector is 
modified by further deletion of the fiber gene, such that the virus 
produces no fiber protein. Growth of the fiber-deleted viruses in 
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packaging cells that express a fiber protein as well as complementing the 
E1 deletion allows generation of particles with any desired fiber. 

Packaging cell lines were generated by stably transfecting 
expression constructs for the fibers of interest (Von Seggern et al., J. 
5 Virol. 74:354-362 (2000)) into an A549-derived E1- and E2a- 

complementing cell line (Gorziglia et al., J. Virol. 70:4173-4178 (1996)), 
and clones that expressed the fibers at high levels were selected. The 
resulting lines complement E1 and fiber deletions, and were used to 
propagate Ad5.GFP.AF, a fiber-deleted Ad5 vector with a GFP transgene 

10 in place of the deleted E1 sequences (Von Seggern et al., J. Virol. 

74:354-362 (2000)). The particles produced by growth in the various 
cell lines are identical except for their fiber proteins. Viral particles were 
isolated by CsCI gradient centrifugation, and assayed for the presence of 
fiber by immunoblotting using monoclonal Ab 4D2. As a control for equal 

15 loading, the blot was re-probed with a polyclonal antibody against the Ad 
penton base protein. All recombinant fibers were capable of assembly 
onto Ad5 particles. 

EXAMPLE 19 

Construction and Propagation of Adenovirus Particles with Genomic Fiber 
20 Substitutions 

The fiber-deletion system described in Example 18 allows rapid 
evaluation of fiber proteins for their infectious properties. The resulting 
particles produced are less infectious than the corresponding first- 

25 generation vectors (Von Seggern et al., J. Virol. 74:354-362 (2000)). 

Therefore, viral backbones with the subgroup B, C or D fibers substituted 
in place of the Ad5 fiber gene were constructed. To facilitate 
construction of these vectors, the AdEasy system (see, U.S. Patent No. 
5,922,576; see, also He et al. (1998) Proc. Natl. Acad. Sci. U.S.A. 

30 55:2509-2514; the system is publicly available from the authors and 
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other sources) was modified. This system includes a large plasmid 
(pAdEasy) that contains most of the Ad5 genome and smaller shuttle 
plasmids with the left end of the viral genome, including an E1 deletion 
and polylinker for insertion of transgenes. Recombination between 
5 pAdEasy and a shuttle plasmid in E. coli reconstitutes a full-length 

infectious Ad genome. All plasmids used were derivatives of pAdEasyl 
with different fiber proteins substituted in place of the Ad5 fiber. 
Construction of pDV153 

p5FloxHRF (SEQ ID NO. 92) contains the right end of the Ad5 
10 genome with a Pad site in place of the right ITR. There is a unique 
(naturally occurring in Ad5) Muni site approximately 30 nucleotides 
downstream of the fiber stop codon. To facilitate fiber substitutions, this 
site was moved to lie immediately downstream of the fiber ORF. This 
preserved the sequence around the fiber gene and its spacing relative to 
15 the other adenovirus genes. 

The oligos MunlTOP (AAT TGT GTT ATG TTT AAA CGT GTT TAT 
TTT TG; SEQ ID NO. 93) and Muni BOTTOM (AAT TCA AAA ATA AAC 
ACG TTT AAA CAT AAC AC; SEQ ID NO. 94) were annealed 
and ligated into the unique Muni site of p5FloxHRF to generate plasmid 
20 pDV153. Insertion of the oligo destroyed the original Muni site by 

changing one base at its 5' end, but resulted in insertion of a new Muni 
site that is 32 base pairs closer to the fiber ORF than the original Muni 
site. 

Construction of an Ad vector with a chimeric Ad5/Ad37 fiber gene 

25 To replace the Ad5 fiber sequence of pDV153 with Ad37 fiber, 

pDV153 was digested with Sphl and Muni. This removed all but the N- 
terminal 183 nucleotides of Ad5 fiber (see SEQ ID NO. 1). Ad37 fiber 
was then PGR amplified using a 5' primer (F37 5'Sphl) TAC CAA TGG 
CAT GCT ATC CCT CAA GG (SEQ ID NO. 95) that added a Sphl site and 
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a 3' primer (F37 3'EcoRI) AAA CAC GGG AAT TCG TCT TTC ATT C 
(SEQ ID NO. 96) that added an EcoRI restriction site. The 3' primer was 
designed to have an EcoRI site since the Ad37 fiber sequence contains a 
Muni restriction site. The nucleotide overhangs left by digestion with 
5 EcoRI and Muni are compatible, allowing the PCR products to be cloned 
into pDV153 digested with Sphl and Muni. This resulted in expression of 
a chimeric Ad5/Ad37 fiber protein with the N-terminal 61 amino acids 
from Ad5 fiber (SEQ ID NO. 2) and the remainder of the protein from 
Ad37 (corresponding to amino acid 62 to the end of Ad37 fiber; SEQ ID 
10 NO. 32). 

After Ad37 fiber was ligated into pDV153, the Spel/Pacl fragment 
was used to replace the Spel/Pacl fragment of pAdEasy, resulting in 
plasmid pDV158. Plasmid pDV158 was then recombined with the shuttle 
plasmid pAdTrack, which contains a CMV-driven EGFP reporter gene (He 

15 eta/., Proc. Natl. Acad. Sci. USA 55:2509-2514 (1998); U.S. Patent 
Serial No. 5,922,576). The resulting Ad vector (Ad5.GFP.37F) has the 
EGFP reporter at the site of the E1 deletion and the chimeric Ad5/Ad37 
fiber gene in the viral chromosome, and infects cells via the Ad37 
receptor rather than CAR. pDV158 can be readily used to create 

20 adenovirus particles with the same fiber protein but different transgenes. 
Propagation of Ad5.GFP.37F in 633 cells 

The Ad5.GFP.37F genome is infectious, and readily begins 
replicating as a virus. Since the 293 cells (ATCC Accession No. CRL 

25 1573) normally used for Ad propagation do not express high levels of the 
Ad37 receptor, this virus does not efficiently propagate. To facilitate viral 
amplification, stocks of the virus were maintained in the 633 cell line 
(ATCC Accession No. PTA-1 145), which expresses a wildtype Ad5 fiber 
protein (Von Seggern eta/., J. Virol. 74:354-362 (2000)). The particles 

30 therefore contain the Ad5 fiber produced by the cells and the chimeric 
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Ad5/Ad37 fiber protein encoded by the virus. The Ad5 fiber allows the 

virus to re-infect the cell lines used for viral growth. A final round of 

growth in 293 cells (which do not express a fiber protein) generates 

particles with only the vector-encoded Ad37 fiber. To assess Ad5 and 

5 Ad37 fiber content of Ad5.GFP.37F particles, viral particles produced in 

either 633 cells or 293 cells were immunoblotted with anti-fiber 

monoclonal Ab 4D2. 633-grown particles contained the Ad5 and Ad5/37 

fibers, while virus produced in 293 cells contained only the Ad5/37 

chimeric fiber. Particles of the first-generation Ad vector Ad5.yffgal.wt, 

10 which contain only the wildtype Ad5 fiber (Wu et a/., Virology 275:78-89 

(2001)), were included as a positive control. As a loading control, the 

same blot was re-probed with a polyclonal antibody against the viral 

penton base protein. 

Preparation of additional Ad5 genomes encoding heterologous 
15 fibers 

These same procedures can be used to construct Ad5 genomes 

containing the 19p (SEQ ID NO. 33), 16 (SEQ ID NO. 37), 30 (SEQ ID 

NO. 35) and 35 (SEQ ID NO. 39) fibers. To improve incorporation of the 

fiber in the resulting particle, each fiber was modified to include the N- 

20 terminal 61 amino acids of Ad5 (see SEQ ID NO. 2 or see nucleotides 1- 
183 in SEQ ID NO. 1) by replacing the corresponding amino acids (i.e., 
the first 61 amino acids) of each heterologous fiber. Similar constructs 
can be made with other heterologous fibers and genomes, such as Ad2. 
For example, for construction of the Ad5/Ad16 chimeric fiber 

25 vector, plasmid pDV153 was digested with Sphl and Muni to remove all 
but the first 183 nucleotides of Ad5 fiber. Ad 16 fiber (SEQ ID NO. 37) 
was PCR amplified using 5' primer F16 5' Sphl: GCC AGC GGC ATG CTC 
CAA CTT AAA (SEQ ID NO. 97) and 3' primer F16 3' Muni: TTT ATC 
AAT TGT GTT GTC AGT CAT CTT C (SEQ ID NO. 98), which contained 
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Sphl and Muni sites, respectively. The PCR product was ligated with 
plasmid pDV153 to generate plasmid pDV182. This resulted in 
expression of a chimeric Ad5/Ad16 fiber protein with the N-terminal 61 
amino acids from Ad5 fiber (SEQ ID NO. 2) and the remainder of the 
5 protein from Ad 16 (corresponding to amino acid 62 to the end of Ad 16 
fiber; SEQ ID NO. 38). 

EXAMPLE 20 

Tripartite leader sequences (TPLs) that are useful in enhancing the 
expression of complementing adenoviral proteins, particularly fiber 

10 protein, for use in preparing an adenoviral gene delivery vector are 

provided. The complete Ad5 TPL was constructed by assembling PCR 
fragments. First, the third TPL exon (exon 3) (nt 9644-9731 of the Ad5 
genome) was amplified from Ad5 genomic DNA using the synthetic 
oligonucleotide primers 5'CTCAACAATTGTGGATCCGTACTCC3' (SEQ ID 

15 NO. 99) and 5'GTGCTCAGCAGATCTTGCGACTGTG3' (SEQ ID NO. 100). 
The resulting product was cloned to the BamHI and Bglll sites of 
pAE1Sp1a (Microbix Biosystems; see also, U.S. Patent No. 6,140,087 
and U.S. Patent No. 6,379,943) using sites in the primers (shown in bold) 
to create plasmid pDV52. A fragment corresponding to the first TPL exon 

20 (exon 1), the natural first intron (intron 1), and the second TPL exon 
(exon 2) (Ad5 nt 6049-7182) was then amplified using primers 
5'GGCGCGTTCGGATCCACTCTCTTCC3' (SEQ ID NO. 101) and 
5'CTACATGCTAGGCAGATCTCGTTCGGAG3' (SEQ ID NO. 102), and 
cloned into the BamHI site of pDV52 (again using sites in the primers) to 

25 create pDV55. 

This plasmid contains a 1.2 kb BamHI/Bglll fragment containing the 
first TPL exon, the natural first intron, and the fused second and third TPL 
exons. The nucleotide sequence of the complete TPL containing the 
noted 5' and 3' restriction sites is shown in SEQ ID NO. 103 with the 
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following nucleotide regions identified: 1-6 nt BamHI site; 7-47 nt first 
leader segment (exon 1); 48-1068 nt natural first intron (intron 1); 1069- 
1 140 nt second leader segment (exon 2); 1 141-1 146 nt fused BamHI and 
Bglll sites; 1 147-1234 nt third leader segment (exon 3); and 1235-1240 
5 nt Bglll site. 

EXAMPLE 21 

Preparation of Adenoviral Gene Delivery Vectors Using Adenoviral 
Packaging Cell Lines 

Adenoviral delivery vectors are prepared to separately lack the 
10 combinations of E1 /fiber and E4/fiber. Such vectors are more replication- 
defective than those previously in use due to the absence of multiple viral 
genes. A preferred adenoviral delivery vector is replication competent but 
only via a non-fiber means is one that only lacks the fiber gene but 
contains the remaining functional adenoviral regulatory and structural 
15 genes. Furthermore, these adenovirus delivery vectors have a higher 
capacity for insertion of foreign DNA. 

A. Preparation of Adenoviral Gene Delivery Vectors Having Specific 
Gene Deletions and Methods of Use 

To construct an E1 /fiber deleted viral vector containing the LacZ 
20 reporter gene construct, two new plasmids were constructed. The 
plasmid pAE1B/?gaI was constructed as follows. A DNA fragment 
containing the SV40 regulatory sequences and £. coli /?-galactosidase 
gene was isolated from pSVyffgal (Promega) by digesting with Vspl, filling 
the overhanging ends by treatment with Klenow fragment of DNA 
25 polymerase I in the presence of dNTPs and digesting with BamHI. The 
resulting fragment was cloned into the EcoRV and BamHI sites in the 
polylinker of pAE1sp1B (Microbix Biosystems; see also, U.S. Patent No. 
6,140,087 and U.S. Patent No. 6,379,943) to form pAE1B£gal that 
therefore contained the left end of the adenovirus genome with the Ela 
30 region replaced by the LacZ cassette (nucleotides 6690 to 4151) of 
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pSV/?gal. Plasmid DNA may be prepared by the alkaline lysis method as 
described by Birnboim and Doly, Nuc. Acids Res., 7:1513-1523 (1978) or 
by the Quiagen method according to the manufacturer's instruction, from 
transformed cells used to expand the plasmid DNA. Plasmid DNA was 
5 then purified by CsCI-ethidium bromide density gradient centrifugation. 
Alternatively, plasmid DNAs may be purified from E. coli by standard 
methods known in the art (e.g. see Sambrook eta/.) 

The second plasmid (pDV44), prepared as described herein, is 
derived from pBHGIO, a vector prepared as described by Bett et al., Proc. 
10 Natl. Acad. Sci., USA, 91:8802-8806 (1994) (see, also International PCT 
application No. WO 95/00655) using methods well known to one of skill 
in the art. This vector also is commercially available from Microbix 
Biosystems and contains an Ad5 genome with the packaging signals at 
the left end deleted and the E3 region (nucleotides 28133:30818) 
15 replaced by a linker with a unique site for the restriction enzyme Pad. An 
1 1 .9 kb BamHI fragment, which contains the right end of the adenovirus 
genome, is isolated from pBHGIO and cloned into the BamHI site of 
pBS/SK( + ) to create plasmid p1 1.3 having approximately 14,658 bp. 
The p1 1 .3 plasmid was then digested with Pad and Sail to remove the 
20 fiber, E4, and inverted terminal repeat (ITR) sequences. 

This fragment was replaced with a 3.4 kb fragment containing the 
ITR segments and the E4 gene which was generated by PCR amplification 
from pBHGIO using the following oligonucleotide sequences: 
5' TGTACACCG GATCCGGCGCACACC3' SEQ ID NO: 104; and 
25 5'CACAACGAGCTC AATTAATTAATTGCCACATCCTC3' SEQ ID 

NO: 105. These primers incorporated sites for Pad and BamHI. Cloning 
this fragment into the Pad and blunt ended Sail sites of the p1 1 .3 
backbone resulted in a substitution of the fused ITRs, E4 region and fiber 
gene present in pBHGIO, by the ITRs and E4 region alone. The resulting 
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p1 1 .3 plasmid containing the ITR and E4 regions, designated plasmid 
pDV43a, was then digested with BamHI. This BamHI fragment was then 
used to replace a BamHI fragment in pBHGIO thereby creating pDV44 in 
a pBHGIO backbone. 
5 In an alternative approach to preparing pDV44 with an additional 

subcloning step to facilitate the incorporation of restriction cloning sites, 
the following cloning procedure was performed. pDV44 as above was 
constructed by removing the fiber gene and some of the residual E3 
sequences from pBHGIO (Microbix Biosystems; see, also U.S. Patent No. 

10 6,140,087). As above, to simplify manipulations, the 1 1 .9 kb BamHI 
fragment including the rightmost part of the Ad5 genome was removed 
from pBHGIO and inserted into pBS/SK. The resulting plasmid was 
termed p1 1 .3. The 3.4 kb DNA fragment corresponding to the E4 region 
and both ITRs of adenovirus type 5 was amplified as described above 

15 from pBHGIO using the oligonucleotides listed above and subcloned into 
the vector pCR2.1 (Invitrogen) to create pDV42. This step is the 
additional cloning step to facilitate the incorporation of a Sail restriction 
site. pDV42 was then digested with Pad, which cuts at a unique site 
(bold type) in one of the PCR primers, and with Sail, which cuts at a 

20 unique site in the pCR2.1 polylinker. This fragment was used to replace 
the corresponding Pacl/Xhol fragment of p1 1.3 (the pBS polylinker 
adjacent to the Ad DNA fragment contains a unique Xhol site), creating 
pDV43. A plasmid designated pDV44 was constructed by replacing the 
1 1 .9 kb BamHI fragment of pBHG1 0 by the analogous BamHI fragment of 

25 pDV43. As generated in the first procedure, pDV44 therefore differs 

from pBHGIO by the deletion of Ad5 nucleotides 30819:32743 (residual 
E3 sequences and all but the 3'-most 41 nucleotides of the fiber open 
reading frame). 
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ln summary, the cloning procedures described above result in the 
production of a fiber-deleted Ad5 genomic plasmid (pDV44) that was 
constructed by removing the fiber gene and some of the residual E3 
sequences from pBHGIO. pDV44 contains a wild-type E4 region, but 
5 only the last 41 nucleotides of the fiber ORF (this sequence was retained 
to avoid affecting expression of the adjacent E4 transcription unit). 
Plasmids pBHGIO and pDV44 contain unpackageable Ad5 genomes, and 
must be rescued by cotransfection and subsequent homologous 
recombination with DNA carrying functional packaging signals. In order 

10 to generate vectors marked with a reporter gene, either pDV44 or 

pBHGIO was cotransfected with pAE1 BBgal, which contains the left end 
of the Ad5 genome with an SV40-driven B-galactosidase reporter gene 
inserted in place of the El region. 

In general, and as described below, the method for virus production 

15 by recombination of plasmids followed by complementation in cell culture 
involves the isolation of recombinant viruses by cotransfection of any 
adenovirus packaging cell system, namely 21 1 A, 211 B, 21 1 R, A549, 
Vero cells, and the like, with plasmids carrying sequences corresponding 
to viral gene delivery vectors. 

20 *A selected cell line is plated in dishes and cotransfected with 

pDV44 and pAE1B£ gal using the calcium phosphate method as described 
by Bett et aL, Proc. Natl. Acad. Sci., USA, 37:8802-8806 (1994). 
Recombination between the overlapping adenovirus sequences in the two 
plasmids leads to the creation of a full-length viral chromosome where 

25 pDV44 and pAE1B/? gal recombine to form a recombinant adenovirus 

vector having multiple deletions. The deletion of E1 and of the fiber gene 
from the viral chromosome is compensated for by the sequences 
integrated into the packaging cell genome, and infectious virus particles 
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are produced. The plaques thus generated are isolated and stocks of the 
recombinant virus are produced by standard methods. 

Because of the fiber deletion, a pDV44-derived virus is replication- 
defective, and cells in which it is grown must complement this defect. 
5 The 21 IB cell line (a derivative of 293 cells which expresses the wild- 
type (wt) AD5 fiber and is equivalent to 21 1 A on deposit with ATCC) 
was used for rescue and propagation of the virus described here. pDV44 
and pAEIfcgal were cotransfected into 21 1B cells, and the monolayers 
were observed for evidence of cytopathic effect (CPE). Briefly, for virus 

10 construction, cells were transfected with the indicated plasmids using the 
Gibco Calcium Phosphate Transfection system according to the 
manufacturer's instructions and observed daily for evidence of CPE. 

One of a total of 58 transfected dishes showed evidence of 
spreading cell death at day 15. A crude freeze-thaw lysate was prepared 

15 from these cells and the resulting virus (termed Ad5.£gal.AF) was plaque 
purified twice and then expanded. To prepare purified viral preparations, 
cells were infected with the indicated Ad and observed for completion of 
CPE. Briefly, at day zero, 21 1B cells were plated in DMEM plus 10% 
fetal calf serum at approximately 1 X 10 7 cells/150 cm 2 flask or 

20 equivalent density. At day one, the medium was replaced with one half 
the original volume of fresh DMEM containing the indicated Ad, in this 
case Ad5.ISgal.AF, at approximately 100 particles/cell. At day two, an 
equal volume of medium was added to each flask and the cells were 
observed for CPE. Two to five days after infection, cells were collected 

25 and virus isolated by lysis via four rapid freeze-thaw cycles. Virus was 
then purified by centrifugation on preformed 15-40% CsCI gradients 
(1 1 1 ,000 x g for three hours at 4°C). The bands were harvested, 
dialyzed into storage buffer (10 mM Tris-pH 8.1, 0.9% NaCI, and 10% 
glycerol), aliquoted and stored at - 70°C. Purified Ad5.fSgal.AF virus 
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particles containing human adenovirus Ad5.fcgal.AFgenome (described 
further below) have been deposited with the ATCC on January 15, 1999. 

For viral titering, Ad preparations were titered by plaque assay on 
21 1B cells. Cells were plated on polylysine-coated 6 well plates at 1 .5 x 
5 10 6 cells/well. Duplicate dilutions of virus stock were added to the plates 
in 1 ml/well of complete DMEM. After a five hour incubation at 37°C, 
virus was removed and the wells overlaid with 2 ml of 0.6% low-melting 
agarose in Medium 199 (Gibco). An additional 1 ml of overlay was added 
at five day intervals. 

10 As a control, the first-generation virus Ad5.R> gal.wt, which is 

identical to Ad5.Bgal.AF except for the fiber deletion, was constructed by 
cotransfection of pBHG10 and pAEIBfcgal. In contrast to the low effi- 
ciency of recovery of the fiberless genome (1/58 dishes), all of 9 dishes 
cotransfected with pAE!B/?gal and pBHG10 produced virus. 

15 In another embodiment, a delivery plasmid is prepared that does 

not require the above-described recombination events to prepare a viral 
vector having a fiber gene deletion. In one embodiment, a single delivery 
plasmid containing all the adenoviral genome necessary for packaging but 
lacking the fiber gene is prepared from plasmid pFG140 containing full- 

20 length Ad5 that is commercially available from Microbix. The resultant 
delivery plasmid referred to as pFG140-f is then used with pCLF (ATCC 
accession number 97737; and described in copending U.S. application 
Serial No. 09/482,682 (also filed as International PCT application No. 
PCT/US00/00265 on January 14, 2000)) stably integrated cells as 

25 described above to prepare a viral vector lacking fiber. For genetic 

therapy, the fiber gene can be replaced with a therapeutic gene of interest 
for preparing a therapeutic delivery adenoviral vector. 

Vectors for the delivery of any desired gene and preferably a 
therapeutic gene are prepared by cloning the gene of interest into the 
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multiple cloning sites in the polylinker of commercially available pAElsplB 
(Microbix Biosystems; see also, U.S. Patent No. 6,140,087), in an 
analogous manner as performed for preparing pE1B/? gal as described 
above. The same cotransfection and recombination procedure is then 
5 followed as described herein to obtain viral gene delivery vectors. 

1. Characterization of the Ad5.jffgal.AF Genome 
To confirm that the vector genomes had the proper structures and 
that the fiber gene was absent from the Ad5.ISgal.AF chromosome, the 
DNA isolated from viral particles was analyzed. Briefly, purified viral DNA 

10 was obtained by adding 10 //I of 10 mg/ml proteinase K, 40 //I of 0.5 M 
EDTA and 50 //I of 10% SDS to 800 //I of adenovirus-containing culture 
supernatant. The suspension was then incubated at 55 °C for 60 
minutes. The solution was then extracted once with 400 fj\ of a 24:1 
mixture of chIoroform:isoamyI alchohol. The aqueous phase was then 

15 removed and precipitated with sodium acetate/ethanol. The pellet was 
washed once with 70% ethanol and lightly dried. The pellet was then 
suspended in 40 //I of 10 mM Tris-HCI, pH 8.0, 1 mM EDTA. Genomic 
DNA from Ad5.Bgal.wt and Ad5.R>gal.AF produced the expected 
restriction patterns following digestion with either EcoRI or with Ndel. 

20 Southern blotting, performed with standard methods, with labeled fiber 
DNA as a probe demonstrated the presence of fiber sequence in 
Ad5.l5gal.wt but not in Ad5.fcgal.AF DNA. As a positive control, the blot 
was stripped and reprobed with labeled E4 sequence. Fiber and E4 
sequences were detected by using labeled inserts from pCLF and 

25 pE4/Hygro, respectively. E4 signal was readily detectable in both 
genomes at equal intensities. The complete nucleotide sequence of 
Ad5.fcgal.AF is presented in SEQ ID NO: 106 and is contained in the virus 
particle deposited with ATCC. 
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2. Characterization of the Fiberless Adenovirus 
Ad5.£gaI.AF 

To verify that Ad5.ISgaI.AF was fiber-defective, 293 cells (which 
are permissive for growth of E1 -deleted Ad vectors but do not express 
5 fiber) were infected with Ad5.fSgaI.AF or with Ad5.Bgal.wt. Twenty-four 
hours post infection, the cells were stained with polyclonal antibodies 
directed either against fiber or against the penton base protein. Cells 
infected with either virus were stained by the anti-penton base antibody, 
while only cells infected with the Ad5.ISgal.wt control virus reacted with 
10 the anti-fiber antibody. This confirms that the fiber-deleted Ad mutant 
does not direct the synthesis of fiber protein. 

3. Growth of the Fiber-Deleted Ad5./?gal.AF Vector in 
Complementing Cells 

Ad5.ISgal.AF was found to readily be propagated in 21 1B cells. As 

15 assayed by protein concentration, CsCI-purified stocks of either 

Ad5.fSgal.AF or Ad5.ISgal.wt contained similar numbers of viral particles. 
The particles appeared to band normally on CsCI gradients. Infectivity of 
the Ad5.fSgal.AF particles was lower than the Ad5.fSgal.wt control, as 
indicated by an increased particle/PFU ratio. Ad5.fSgal.AF was also 

20 found to plaque more slowly than the control virus. When plated on 

21 1B cells, Ad5.fSgal.wt plaques appeared within 5-7 days, while plaques 
of Ad5.ISgal.AF continued to appear until as much as 15-18 days post 
infection. Despite their slower formation, the morphology of Ad5.fSgal.AF 
plaques was essentially normal. 

25 4. Production of Fiberless Ad5.ISgal.AF Particles 

As Ad5.(Sgal.AF represents a true fiber null mutation and its stocks 
are free of helper virus, the fiber mutant phenotype was readily 
investigated. A single round of growth in cells (such as 293) which do 
not produce fiber generating a homogeneous preparation of fiberless Ad 

30 allowed for the determination of whether such particles would be stable 
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and/or infectious. Either Ad5.fSgal.wt or Ad5.Bgal.AF was grown in 293 
or 21 1B cells, and the resulting particles purified on CsCI gradients as 
previously described. Ad5.Bgal.AF particles were readily produced in 293 
cells at approximately the same level as the control virus and behaved 
5 similarly on the gradients, indicating that there was not a gross defect in 
morphogenesis of fiberless capsids. 

Particles of either virus contained similar amounts of penton base 
regardless of the cell type in which they were grown. This demonstrated 
that fiber is not required for assembly of the penton base complex into 

10 virions. The Ad5.Bgal.AF particles produced in 293 cells did not contain 
fiber protein. 21 IB-grown Ad5.Bgal.AF also contained less fiber than the 
Ad5.Bgal.wt control virus. The infectivities of the different viral 
preparations on epithelial cells correlated with the amount of fiber protein 
present. The fiberless Ad particles were several thousand-fold less 

15 infectious than the first-generation vector control on a per-particle basis, 

while infectivity of 21 IB-grown Ad5.Bgal.AF was only 50-100 fold less 

than that of Ad5.Bgal.wt. These studies confirmed fiber's crucial role in 

infection of epithelial cells via CAR binding. 

5. Composition and Structure of the Fiberless 
20 Ad5.BgaI.AF Particles 

The proteins contained in particles of 293-grown Ad5.Bgal.AF were 

compared to those in Ad5.Bgal.wt, to determine whether proteolysis or 

particle assembly was defective in this fiber null mutant. The overall 

pattern of proteins in the fiberless particles was observed to be quite 

25 similar to that of a first-generation vector, with the exception of reduced 

intensity of the composite band resulting from proteins Ilia and IV (fiber). 

The fiberless particles also had a reduced level of protein VII. Although 

substantial amounts of uncleaved precursors to proteins VI, VII, and VIII 

were not seen, it is possible that the low-molecular weight bands 
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migrating ahead of protein VII represent either aberrantly cleaved viral 
proteins or their breakdown products. 

Cryo-electron microscopy was used to more closely examine the 
structure of the 293 grown Ad5.fcgal.AF and of Ad5fcgal.wt. The fiber, 
5 having an extended stalk with a knob at the end, was faintly visible in 
favorable orientations of wild-type Ad5 particles, but not in images of the 
fiberless particles. Filamentous material likely corresponding to free viral 
DNA was seen in micrographs of fiberless particles. This material was 
also present in micrographs of the first-generation control virus, albeit at 
10 much lower levels. 

Three-dimensional image reconstructions of fiberless and wild-type 
particles at -20 A resolution showed similar sizes and overall features, 
with the exception that fiberless particles lacked density corresponding to 
the fiber protein. The densities corresponding to other capsid proteins, 

15 including penton base and proteins Ilia, VI, and IX, were comparable in 
the two structures. This confirms that absence of fiber does not prevent 
assembly of these components into virions. The fiber was truncated in 
the wild-type structure as only the lower portion of its flexible shaft 
follows icosahedral symmetry. The RGD protrusions on the fiberless 

20 penton base were angled slightly inward relative to those of the wild-type 
structure. Another difference between the two penton base proteins was 
that there is a -30 A diameter depression in the fiberless penton base 
around the five-fold axis where the fiber would normally sit. The Ad5 
reconstructions confirm that capsid assembly, including addition of 

25 penton base to the vertices, is able to proceed in the complete absence of 
fiber. 
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6. Integrin-Dependent Infectivity of Fiberless Ad5.KgaI.AF 
Particles 

While attachment via the viral fiber protein is a critical step in the 
infection of epithelial cells, an alternative pathway for infection of certain 
5 hematopoietic cells has been described. In this case, penton base 

mediates binding to the cells (via B2 integrins) and internalization (through 
interaction with av integrins). Particles lacking fiber might therefore be 
expected to be competent for infection of these cells, even though on a 
per-particle basis they are several thousand-fold less infectious than 

10 normal Ad vectors on epithelial cells. 

To investigate this, THP-1 monocytic cells were infected with 
Ad5.fcgal.wt or with Ad5.ISgal.AF grown in the absence of fiber. 
Infection of THP-1 cells was assayed by infecting 2 x 10 5 cells at the 
indicated m.o.i. in 0.5 ml of complete RPMI. Forty-eight hours 

15 post-infection, the cells were fixed with glutaraldehyde and stained with 
X-gal, and the percentage of stained cells was determined by light 
microscopy. The results of the infection assay showed that the fiberless 
particles were only a few-fold less infectious than first-generation Ad on 
THP-1 cells. Large differences were seen in plaquing efficiency on 

20 epithelial (21 1 B) cells. Infection of THP-1 cells by either Ad5.fegal.AF or 
Ad5.(Sgal.wt was not blocked by an excess of soluble recombinant fiber 
protein, but could be inhibited by the addition of recombinant penton 
base). These results indicate that the fiberless Ad particles use a 
fiber-independent pathway to infect these cells. Furthermore, the lack of 

25 fiber protein did not prevent Ad5.fcgalAF from internalizing into the cells 
and delivering its genome to the nucleus, demonstrating that fiberless 
particles are properly assembled and are capable of uncoating. 

The foregoing results with the recombinant viruses thus produced 
indicates that they can be used as gene delivery tools in cultured cells 
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and in vivo. For example, for studies of the effectiveness and relative 
immunogenicity of multiply-deleted vectors, virus particles are produced 
by growth in packaging lines and are purified by CsCI gradient 
centrifugation. Following titering, virus particles are administered to mice 
5 via systemic or local injection or by aerosol delivery to lung. The LacZ 
reporter gene allows the number and type of cells which are successfully 
transduced to be evaluated. The duration of transgene expression is 
evaluated in order to determine the long-term effectiveness of treatment 
with multiply-deleted recombinant adenoviruses relative to the standard 

10 technologies which have been used in clinical trials to date. The immune 
response to the improved vectors described here is determined by 
assessing parameters such as inflammation, production of cytotoxic T 
lymphocytes directed against the vector, and the nature and magnitude of 
the antibody response directed against viral proteins. 

15 Versions of the vectors which contain therapeutic genes such as 

CFTR for treatment of cystic fibrosis or tumor suppressor genes for 

cancer treatment are evaluated in the animal system for safety and 

efficiency of gene transfer and expression. Following this evaluation, 

they are used as experimental therapeutic agents in human clinical trials. 

20 B. Retargeting of Adenoviral Gene Delivery Vectors by 

Producing Viral Particles Containing Different or Altered Fiber 
Proteins 

As the specificity of adenovirus binding to target cells is largely 
determined by the fiber protein, viral particles that incorporate modified 
25 fiber proteins or fiber proteins from different adenoviral serotypes 

(pseudotyped vectors) have different specificities. Thus, the methods of 
expression of the native Ad5 fiber protein in adenovirus packaging cells 
as described above also is applicable to production of different fiber 
proteins. 
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Chimeric fiber proteins can be produced according to known 
methods (see, e.g., Stevenson et al. (1995) J. Virol., 65:2850-2857). 
Determinants for fiber receptor binding activity are located in the head 
domain of the fiber and an isolated head domain is capable of 
5 trimerization and binding to cellular receptors. The head domains of 
adenovirus type 3 (Ad3) and Ad5 were exchanged in order to produce 
chimeric fiber proteins. Similar constructs for encoding chimeric fiber 
proteins for use in the methods herein are contemplated. Thus, instead of 
using the intact Ad5 fiber-encoding construct (prepared above and in U.S. 

10 application Serial No. 09/482,682) as a complementing viral vector in 
adenoviral packaging cells, the constructs described herein are used to 
transfect cells along with E4 and/or El -encoding constructs. 

Briefly, full-length Ad5 and Ad3 fiber genes were amplified from 
purified adenovirus genomic DNA as a template. The Ad5 and Ad3 

15 nucleotide sequences are available with the respective GenBank 

Accession Numbers M18369 and M12411. Oligonucleotide primers are 
designed to amplify the entire coding sequence of the full-length fiber 
genes, starting from the start codon, ATG, and ending with the 
termination codon TAA. For cloning purposes, the 5' and 3' primers 

20 contain the respective restriction sites BamHI and Notl for cloning into 
pcDNA plasmid. PCR is performed as described above. 

The resulting products are then used to construct chimeric fiber 
constructs by PCR gene overlap extension (Horton et al. (1990) 
BioTechniques, 5:525-535). The Ad5 fiber tail and shaft regions (5TS; 

25 the nucleotide region encoding amino acid residue positions 1 to 403) are 
connected to the Ad3 fiber head region (3H; the nucleotide region 
encoding amino acid residue positions 136 to 319) to form the 5TS3H 
fiber chimera. Conversely, the Ad3 fiber tail and shaft regions (3TS; the 
nucleotide region encoding amino acid residues positions 1 to 135) are 
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connected to the Ad5 fiber head region (5H; the nucleotide region 
encoding the amino acid residue positions 404 to 581) to form the 3TS5H 
fiber chimera. The fusions are made at the conserved TLWT (SEQ ID NO: 
46) sequence at the fiber shaft-head junction. 
5 The resultant chimeric fiber PCR products are then digested with 

BamHI and Notl for separate directional ligation into a similarly digested 
pcDNA3.1 . The TPL sequence is then subcloned into the BamHI for 
preparing an expression vector for subsequent transfection into 21 1 cells 
or into alternative packaging cell systems. The resultant chimeric fiber 

10 construct-containing adenoviral packaging cell lines are then used to 

complement adenoviral delivery vectors as previously described. Other 
fiber chimeric constructs are obtained with the various adenovirus 
serotypes using a similar approach. 

In an alternative embodiment, the use of modified proteins 

15 including with modified epitopes (see, e.g., Michael eta/. (1995) Gene 
Therapy, 2:660-668 and International PCT application Publication No. 
WO 95/26412, which describe the construction of a cell-type specific 
therapeutic viral vector having a new binding specificity incorporated into 
the virus concurrent with the destruction of the endogenous viral binding 

20 specificity). In particular, the authors described the production of an 

adenoviral vector encoding a gastrin releasing peptide (GRP) at the 3' end 
of the coding sequence of the Ad5 fiber gene. The resulting fiber-GRP 
fusion protein was expressed and shown to assemble functional fiber 
trimers that were correctly transported to the nucleus of HeLa cells 

25 following synthesis. 

Similar constructs are contemplated for use in the complementing 
adenoviral packaging cell systems for generating new adenoviral gene 
delivery vectors that are targetable, replication-deficient and less 
immunogenic. Heterologous ligands contemplated for use herein to 
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redirect fiber specificity range from as few as 10 amino acids in size to 
large globular structures, some of which necessitate the addition of a 
spacer region so as to reduce or preclude steric hindrance of the 
heterologous ligand with the fiber or prevent trimerization of the fiber 
5 protein. The ligands are inserted at the end or within the linker region. 
Preferred ligands include those that target specific cell receptors or those 
that are used for coupling to other moieties such as biotin and avidin. 

A preferred spacer includes a short 12 amino acid peptide linker 
composed of a series of serines and alanine flanked by a proline residue 

10 at each end using routine procedures known to those of skill in the art. 
The skilled artisan will be with the preparation of linkers to accomplish 
sufficient protein presentation and to alter the binding specificity of the 
fiber protein without compromising the cellular events that follow viral 
internalization. Moreover, within the context of this disclosure, 

15 preparation of modified fibers having ligands positioned internally within 
the fiber protein and at the carboxy terminus as described below are 
contemplated for use with the methods described herein. 

The preparation of a fiber having a heterologous binding ligand is 
prepared essentially as described in the above-cited paper. Briefly, for the 

20 ligand of choice, site-directed mutagenesis is used to insert the coding 
sequence for a linker into the 3' end of the Ad5 fiber construct in pCLF. 

The 3' or antisense or mutagenic oligonucleotide encodes a 
preferred linker sequence of ProSerAlaSerAlaSerAIaSerAIaProGlySer (SEQ 
ID NO: 107) followed by a unique restriction site and two stop codons, 

25 respectively, to allow the insertion of a coding sequence for a selected 

heterologous ligand and to ensure proper translation termination. Flanking 
this linker sequence, the mutagenic oligonucloetide contains sequences 
that overlap with the vector sequence and allow its incorporation into the 
construct. Following mutagenesis of the pCLF sequence adding the 
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linker and stop codon sequences, a nucleotide sequence encoding a 
preselected ligand is obtained, linkers corresponding to the unique 
restriction site in the modified construct are attached and then the 
sequence is cloned into linearized corresponding restriction site. 
5 The resultant fiber-ligand construct is then used to transfect 21 1 or the 
alternative cell packaging systems previously described to produce 
complementing viral vector packaging systems. 

In a further embodiment, intact fiber genes from different Ad 
serotypes are expressed by 21 1 cells or an alternative packaging system. 

10 A gene encoding the fiber protein of interest is first cloned to create a 
plasmid analogous to pCLF, and stable cell lines producing the fiber 
protein are generated as described above for Ad5 fiber. The adenovirus 
vector described which lacks the fiber gene is then propagated in the cell 
line producing the fiber protein relevant for the purpose at hand. As the 

15 only fiber gene present is the one in the packaging cells, the adenoviruses 
produced contain only the fiber protein of interest and therefore have the 
binding specificity conferred by the complementing protein. Such viral 
particles are used in studies such as those described above to determine 
their properties in experimental animal systems. 

20 EXAMPLE 22 

Preparation and Use of Adenoviral Packaging Cell Lines 
Containing Plasmids Containing Alternative TPLs 

Plasmids containing tripartite leaders (TPLs) have been constructed. 

The resulting plasmids that contain different selectable markers, such as 

25 neomycin and zeocin, were then used to prepare fiber-complementing 

stable cell lines for use as for preparing adenoviral vectors. 

A. pDV60 

Plasmid pDV60 was constructed by inserting the TPL cassette of 
SEQ ID NO. 88 into the BamHI site upstream of the Ad5 fiber gene in 
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pcDNA3/Fiber, a neomycin selectable plasmid (see, e.g., U.S. application 
Serial No. 09/482,682 (also filed as International PCT application No. 
PCT/US00/00265 on January 14, 2000); see also Von Seggern et al. 
(1998) J. Gen Virol., 79: 1461-1468). The nucleotide sequence of 
5 pDV60 is listed in SEQ ID NO: 108. Plasmid pDV60 is available from the 
ATCC under accession number PTA-1 144. 

B. pDV61 

To construct pDV61, an Asp718/Notl fragment containing the 
CMV promoter, partial Ad5 TPL, wildtype Ad5 fiber gene, and bovine 

10 growth hormone terminator was transferred from pCLF (ATCC accession 
number 97737; and described in copending U.S. application Serial 
No. 09/482,682 (also filed as International PCT application No. 
PCT/US00/00265 on January 14, 2000)), to a zeocin selectable cloning 
vector referred to as pCDNA3.1/Zeo ( + ) (commerically available from 

15 Invitrogen and for which the sequence is known). 

C. pDV67 

In an analogous process, pDV67 containing complete TPL was 
constructed by transferring an Asp 718/Xbal fragment from pDV60 into 
pcDNA3.1/Zeo( + ) backbone. The nucleotide sequence of pDV67 is set 
20 forth in SEQ ID NO. 109. Plasmid pDV67 is available from the ATCC 
under accession number PTA-1 145. 

D. pDV69 

To prepare pDV69 containing a modified fiber protein, the chimeric 
Ad3/Ad5 fiber gene was amplified from pGEM5TS3H (Stevenson et al. 
25 (1995) J. Virol., 69: 2850-2857) using the primers 5'ATGGGAT 
CAAGATGAAGCGCGCAAGACCG3' (SEQ ID NO. 110) and 
5'CACTATAGCGGCCGCATTCTCAGTCATCTT3' (SEQ ID NO. 111), and 
cloned to the BamHI and Notl sites of pcDNA3.1/Zeo( + ) via new BamHI 
and Notl sites engineered into the primers to create pDV68. Finally, the 
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complete TPL fragment described above was then added to the unique 
BamHI site of pDV68 to create pDV69. The nucleotide sequence of 
pDV69 is listed in SEQ ID NO. 1 12 and the plasmid is available from the 
ATCC under accession number PTA-1 146. 
5 E. Preparation of Stable Adenovirus Packaging Cell Lines 

E1-2a S8 cells are derivatives of the A549 lung carcinoma line 
(ATCC # CCL 185) with chromosomal insertions of the plasmids 
pGRE5-2.E1 (also referred to as GRE5-E1 -SV40-Hygro construct and 
listed in SEQ ID NO. 47) and pMNeoE2a-3. 1 (also referred to as MMTV- 

10 E2a-SV40-Neo construct and listed in SEQ ID NO. 48), which provide 
complementation of the adenoviral E1 and E2a functions, respectively. 
This line and its derivatives were grown in Richter's modified medium 
(BioWhitaker) + 10% FCS. E1-2a S8 cells were electroporated as 
previously described (Von Seggern et al. (1998) J. Gen Virol. , 79: 1461- 

15 1468) with pDV61, pDV67, or with pDV69, and stable lines were 
selected with zeocin (600 jt/g/ml). 

The cell line generated with pDV61 is designated 601 . The cell line 
generated with pDV67 is designated 633 while that generated with 
pDV69 is designated 644. Candidate clones were evaluated by 

20 immunofluorescent staining with a polyclonal antibody raised against the 
Ad2 fiber. Lines expressing the highest level of fiber protein were further 
characterized. 

For the S8 cell complementing cell lines, to induce E1 expression, 
0.3 //M of dexamethasone was added to cell cultures 16-24 hours prior to 
25 challenge with virus for optimal growth kinetics. For preparing viral 

plaques, 5 X 10 5 cells/well in 6 well plates are prepared and pre-induced 
with the same concentration of dexamethasone the day prior to infection 
with 0.5 juM included at a final concentration in the agar overlay after 
infection. 
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F. Cell Lines for Complementation of E17E2a Vectors 

The Adenovirus 5 genome was digested with Seal enzyme, 
separated on an agarose gel, and the 6,095 bp fragment containing the 
left end of the virus genome was isolated. The complete Adenovirus 5 
5 genome is registered as Genbank accession #M73260 (or see SEQ ID NO. 
1), incorporated herein by reference, and the virus is available from the 
American Type Culture Collection, Manassas, Virginia, U.S.A., under 
accession number VR-5. The Seal 6,095 bp fragment was digested 
further with Clal at bp 917 and Bglll at bp 3,328. The resulting 2,41 1 bp 

10 Clal to Bglll fragment was purified from an agarose gel and ligated into 
the superlinker shuttle plasmid pSE280 (Invitrogen, San Diego, CA), 
which was digested with Clal and Bglll, to form pSE280-E. 

Polymerase chain reaction (PCR) was performed to synthesize DNA 
encoding an Xhol and Sail restriction site contiguous with Adenovirus 5 

15 DNA bp 552 through 924. The primers which were employed were as 
follows: 

5' end, Ad5 bp 552-585: 

5'-GTCACTCGAGGACTCGGTC-GACTGAAAATGAGACATATTATCTGCC 
ACGGACC-3' (SEQ ID NO. 113) 
20 3' end, Ad5 bp 922-891: 

5'-CGAGATCGATCACCTCCGGTACAAGGTTTGGCATAG-3' (SEQ ID NO. 
114) 

This amplified DNA fragment (also referred to herein as Fragment 
A) was digested with Xhol and Clal, which cleaves at the native Clal site 
25 (bp 917), and ligated to the Xhol and Clal sites of pSE280-E, thus 

reconstituting the 5' end of the E1 region beginning 8 bp upstream of the 
ATG codon. PCR amplification then was performed to amplify Ad 5 DNA 
from bp 3,323 through 4,090 contiguous with an EcoRI restriction site. 
The primers employed were as follows: 
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5' end, Ad5 bp 3323-3360: 

5'-CATGAAGATCTGGAAGGTGCTGAGGTACGATGAGACC-3' (SEQ ID 
NO. 115); and 
3' end, Ad5 bp 4090-4060: 
5 B'-GCGACTTAAGCAGTCAGCTG-AGACAGCAAGACACTTGCTTGATCCA 
AATCC-3' (SEQ ID NO. 116). 

This amplified DNA fragment (also referred to herein as Fragment 
B) was digested with Bglll, thereby cutting at the Adenovirus 5 Bglll site 
(bp 3,382) and EcoRI, and ligated to the Bglll and EcoRI sites of 
10 pSE280-AE to reconstruct the complete E1a and E1b region from 

Adenovirus 5 bp 552 through 4,090. The resulting plasmid is designated 
pSE280-E1. 

A construct containing the intact E1a/b region under the control of 
the synthetic promoter GRE5 was prepared as follows. The intact E1a/b 

15 region was excised from pSE280-E1, which was modified previously to 
contain a BamHI site 3' to the E1 gene, by digesting with Xhol and 
BamHL The Xhol to BamHI fragment containing the E1a/b fragment was 
cloned into the unique Xhol and BamHI sites of pGRE5-2/EBV (U.S. 
Biochemicals, Cleveland, Ohio) to form pGRE5-E1). 

20 Bacterial transformants containing the final construct were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA 

prior to use for transfection of cells. 

G. Construction of plasmid including Adenovirus 5 E2A 
sequence 

25 The Adenovirus 5 genome was digested with BamHI and Spel, 

which cut at bp 21,562 and 27,080, respectively. Fragments were 
separated on an agarose gel and the 5,518 bp BamHI to Spel fragment 
was isolated. The 5,518 bp BamHI to Spel fragment was digested further 
with Smal, which cuts at bp 23,912. The resulting 2,350 bp BamHI to 
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Smal fragment was purified from an agarose gel, and ligated into the 
superlinker shuttle plasmid pSE280, and digested with BamHI and Smal to 
form pSE280-E2 BamHI-Smal. 

PGR then was performed to amplify Adenovirus 5 DNA from the 
5 Smal site at bp 23,912 through 24,730 contiguous with Nhel and EcoRI 
restriction sites. The primers which were employed were as follows: 
5' end, Ad5 bp 24,732-24,708: 

5'-CACGAATTCGTCAGCGCTTCTCGTCGCGTCCAAGACCC-3' (SEQ ID 

NO. 117); 
10 3' end, Ad5 bp 23,912-23,934: 

5 , -CACCCCGGGGAGGCGGCGGCGACGGGGACGGG-3 , (SEQ ID NO. 118) 
This amplified DNA fragment was digested with Smal and EcoRI, 

and ligated to the Smal and EcoRI sites of pSE280-E2 Bam-Sma to 

reconstruct the complete E2a region from Ad5 bp 24,730 through 
15 21,562. The resulting construct is pSE280-E2a. 

In order to convert the BamHI site at the 3' end of E2a to a Sail 

site, the E2a region was excised from pSE280-E2a by cutting with BamHI 

and Nhel, and recloned into the unique BamHI and Nhel sites of pSE280. 

Subsequently, the E2a region was excised from this construction with 
20 Nhel and Sail in order to clone into the Nhel and Sail sites of the 

pMAMneo (Clonetech, Palo Alto, CA) multiple cloning site in a 5' to 3' 

orientation, respectively. The resulting construct is pMAMneo E2a. 

Bacterial transformants containing the final pMAMneo-E2a were 

identified. Plasmid DNA was prepared and purified by banding in CsTFA. 
25 Circular plasmid DNA was linearized at the Xmnl site within the ampicillin 

resistance gene of pMAMneo-E2a, and further purified by the 

phenol/chloroform extraction and ethanol precipitation prior to use for 

transfection of cells. 
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H. Transfection and selection of cells 

In general, this process involved the sequential introduction, by 
calcium phosphate precipitation, or other means of DNA delivery, of two 
plasmid constructions each with a different viral gene, into a single tissue 
5 culture cell. The cells were transfected with a first construct and selected 
for expression of the associated drug resistance gene to establish stable 
integrants. Individual cell clones were established and assayed for 
function of the introduced viral gene. Appropriate candidate clones then 
were transfected with a second construct including a second viral gene 
10 and a second selectable marker. Transfected cells then were selected to 
establish stable integrants of the second construct, and cell clones were 
established. Cell clones were assayed for functional expression of both 
viral genes. 

A549 (ATCC Accession No. CCL-185) were used for transfection. 

15 Appropriate selection conditions were established for G418 and 

hygromycin B by standard kill curve determination. 

Transfection of A549 cells with plasmids including E1 and 
E2a regions. 

pMAMNeo-E2a was linearized with Xmnl with the Amp R gene, 
20 introduced into cells by transfection, and cells were selected for stable 
integration of this plasmid by G418 selection until drug resistant colonies 
arose. The clones were isolated and screened for E2a expression by 
staining for E2a protein with a polyclonal antiserum, and visualizing by 
immunofluorescence. E2a function was screened by complementation of 
25 the temperature-sensitive mutant Ad5ts125 virus which contains a 

temperature-sensitive mutation in the E2a gene. (Van Der Vliet, et al., J. 
Virology, Vol. 15, pgs. 348-354 (1975)}. Positive clones expressing the 
E2a gene were Identified and used for transfection with the 7 kb EcoRV 
to Xmnl fragment from pGRE5-El, which contains the GRE5 promoted 
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E1a/b region plus the hygromycin R gene. Cells were selected for 

hygromycin resistance and assayed for E1a/b expression by staining with 

a monoclonal antibody for the E1 protein (Oncogene Sciences, Uniondale, 

N.Y.). E1 function was assayed by ability to complement an E1 -deleted 

5 vector. At this point, expression and function of E2a was verified as 

described above, thus establishing the expression of E1a/b and E2a in the 

positive cell clones. 

A transfected A549 (A549 (ATCC Accession No. CCL-1 85);) cell 

line showed good E1a/b and E2a expression and was selected for further 

10 characterization. It was designated the S8 cell line. 

I. Preparation of Adenoviral Vectors Containing Ad5.£gal.AF 
Genome in S8 Fiber-Complementing Cell Lines 

To prepare adenoviral vectors containing Ad5./?gal.AF (Ad5.£gaI.AF 

has been was deposited the ATCC under accession number VR2636) in 

15 S8 cells containing alternative forms of TPL for enhancing the expression 
of fiber proteins, the protocol as described in Example 21 for preparing 
Ad5./?gal.AF in 211 B cells was followed with the exception of pretreat- 
ment with 0.3 //M dexamethasone for 24 hours as described above. 
Thus, viral particles with the wildtype Ad5 fiber protein on their surface 

20 and containing the fiberless Ad5.yffgal.AF genome were produced in 633 
cells. Particles produced in 644 cells also contained the fiberless 
Ad5.£gal.AF genome, but had the chimeric 5T3H fiber protein, with the 
Ad3 fiber knob, on their surface. These viral preparations can be used to 
target delivery of the Ad5.£gaI.AF, Ad5.GFP.AF, or other similarly 

25 constructed fiberless genome with either wild-type or modified fibers. 
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EXAMPLE 23 

Enhanced infectivity of dendritic cells by pseudotyped adenoviral particles 

Bone marrow-derived dendritic cells were generated by culture of 
bone marrow cells from female Balb/C mice with GM-CSF and IL-4 (Inaba 
5 et al. (1998) Isolation of dendritic cells in Current Protocols in 

Immunology, John Wiley & Sons, Inc. Philadelphia, 3.7.1-3.7.15). To 
confirm that the cultured cells expressed surface markers characteristic of 
dendritic cells, the cells were stained with fluorescently-conjugated 
antibodies directed against CD1lc, CD80, and CD86 and analyzed by 

10 fluorescence-activated cell sorting (FACS) analysis. Antibodies against 
the dendritic cell markers CD1 1c, CD80 and CD86 are commercially 
available, such as from eBioscience. 

The primary dendritic cell cultures were infected with 100,000 viral 
particles/cell of Ad5.GFP.AF pseudotyped with either Ad5, Ad16, Ad19p, 

15 Ad30, Ad35 or Ad37 fiber. The percent of cells positive for virus-induced 
GFP expression was determined by FACS analysis 48 hours after 
infection. All infections were performed in triplicate, and the mean ± 
standard deviation was determined. 

In agreement with previous experiments, Ad5.GFP.AF pseudotyped 

20 with Ad5 fiber infected dendritic cells poorly with approximately 10% of 
cells positive for GFP expression, which is likely due to the lack of CAR 
expression on dendritic cells. In contrast, viruses carrying the Ad 16, 
Ad19p, Ad30, Ad35 or Ad37 fiber proteins demonstrated enhanced 
infectivity of dendritic cells (approximately 49%, 46%, 37%, 26% and 

25 50% of cells were GFP-positive), indicating that the fiber receptors for 
these serotypes are expressed on dendritic cells. 
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EX AMPLE 24 

Subgroup D adenoviruses demonstrate selective infectivity 

Sequence and phylogenetic analysis of adenovirus fiber DNA and 
amino acid sequence suggests that subgroup B and subgroup D viruses 
5 bind different cellular receptors (Havenga et al. (2002) J. Virol. 76:4612- 
4620). In addition, while subgroup B viruses (such as Ad16, Ad35 and 
Ad50), are capable of infecting a wide variety of cancer cell lines and 
primary cells, including endothelial cells, smooth muscle cells, 
synoviocytes, fibroblasts, amniocytes, dendritic cells, bone marrow 

10 stroma cells, chondrocytes, myoblasts, melanocytes, follicle dermal 

papilla cells and hematopoietic stem cells (Havenga et al. (2002) J. Virol. 
76:461 2-4620), subgroup D viruses have a more selective tropism. 

To determine whether select subgroup B (Ad3, Ad 16 and Ad35) 
and subgroup D (Ad19p, Ad30 and Ad37) adenoviruses exhibit the same 

15 cellular tropism, a panel of cancer cell lines were tested for their capacity 
to support Ad gene delivery. The cell lines used were PC-3 cells, HepG2 
cells, LNCaP cells and DU 145 cells. These cell lines are available from 
the ATCC under accession numbers CRL-1435, HB-8065, CRL-10995 and 
HTB-81, respectively. 

20 Each cell line was infected with either 1000, 5000 or 10,000 

particles per cell of Ad5.GFP.AF pseudotyped with Ad5 (subgroup C), 
Ad3, Ad16, Ad19p, Ad30, Ad35 or Ad37 fiber. After 48 hours, virus- 
directed GFP expression was determined by FACS analysis. For PC-3 
cells infected with 1000 particles per cell, little to no GFP expression was 

25 detected in cells infected with viruses pseudotyped with subgroup D 

fibers Ad19p, Ad30 and Ad37. In contrast, GFP-expression was detected 
in approximately 40% of PC-3 cells infected with Ad 16 and Ad35 fiber 
containing viruses. A similar pattern of GFP expression was found with 
cells infected at higher multiplicities of infection (MOIs). Approximately 
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80% of PC-3 cells infected with 5000 particles per cell of adenoviruses 
pseudotyped with Ad 16 or Ad35 fiber were GFP positive, whereas only 
2% of PC-3 cells were GFP positive when infected with Ad19p or Ad30 
fiber pseudotyped viruses. 
5 Similarly, in HepG2 cells, approximately 80% of cells were GFP- 

positive when infected with 5000 particles per cell of Ad 16 or Ad35 fiber 
pseudotyped viruses, but less than 25% were GFP-positive when infected 
with either Ad19p or Ad30 fiber pseudotyped viruses. In addition, less 
than 10% of LNCaP cells were GFP-positive when infected with either 

10 5000 or 10,000 particles per cell of Ad19p, Ad30 or Ad37 fiber 

containing adenoviruses, whereas Ad16 and Ad35 fiber directed GFP 
expression in approximately 65% of LNCaP cells. A similar pattern of 
infection was found in DU 145 cells. These results further demonstrate 
that subgroup B adenoviruses have a wider cellular tropism than subgroup 

15 D viruses and provides additional evidence that subgroup B and subgroup 
D adenoviruses use different receptors for cell binding and infection. 

EXAMPLE 25 

Immunization with adenovirus particles pseudotyped with Ad37 fiber 
20 results in T-cell stimulation 

The following experiment was performed to determine whether 

immunization of mice with adenoviral particles pseudotyped with fiber 

protein from subgroup D adenovirus leads to stimulation of CD8+ T cells. 

Mice (eight in each experimental group, four in the vehicle (control) 

25 group) were immunized by subcutaneous injection with 1 x 10 10 particles 

of either Ad5.GFP.WT (Ad5 particles pseudotyped with Ad5 fiber) or 

Ad5.GFP.F37 (Ad5 particles pseudotyped with Ad37 fiber). Four weeks 

following innoculation, spleens were harvested to quantitate stimulation 

of T cells by determining the number of IFN-jz-positive CD8+ T cells. 



WO 2004/099422 



PCT/US2004/009219 



-166- 

To determine the percentage of activated CD8 + T cells in 
immunized mice, spleens were isolated and mechanically disrupted. 
Following lysis of red blood cells, 1 x 10 6 splenocytes were cultured for 
three hours in RPMI with 10% fetal calf serum and Golgiplug (BD 
5 Biosciences), in the presence or absence of 0.1 ywg/ml EGFP epitope 
peptide HYLSTQSAL or the irrelevant OVA peptide (SIINFEKL) as a 
control. Cells were then stained with an APC-conjugated anti-CD8 
antibody (eBioscience), fixed and permeabilized using the 
Cytofix/Cytoperm kit (BD Biosciences) and stained with a PE-conjugated 

10 antibody against IFN-K- The cells were analyzed by fluorescence 

activated cell sorting (FACS) and the percentage of CD8+ cells positive 
for IFN-k was determined ((number of CD8+ IFN-K + cells divided by the 
total number of CD8+ T cells) x 100). 

Immunization with adenovirus particles pseudotyped with either 

15 Ad5 fiber or Ad37 fiber led to stimulation of CD8+ T cells, as indicated 
by production of IFN-k «n these cells. These results indicate adenovirus 
particles with Ad37 fiber are excellent vaccine candidates due to their 
ability to stimulate CD8+ T cells while avoiding transduction of liver cells. 

20 Since modifications will be apparent to those of skill in this art, it is 

intended that this invention be limited only by the scope of the appended 
claims. 
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WHAT IS CLAIMED IS: 

1 . An adenovirus particle, comprising a heterologous fiber or a 
portion thereof, whereby binding of the viral particle to dendritic cells is 
increased compared to a particle that expresses its native fiber, wherein: 

the adenovirus (Ad) particle, except for the fiber, is from a 
subgroup C adenovirus; and 

the fiber includes fiber from a subgroup D adenovirus for 
binding to dendritic cells, wherein the subgroup D adenovirus is selected 
from the group consisting of adenovirus serotype 8, 9, 10, 13, 15, 17, 
19a, 19p, 20, 22-30, 32, 33, 36, 38, 39 and 42-49. 

2. An adenovirus particle, comprising a heterologous fiber or a 
portion thereof, whereby binding of the viral particle to heparin sulfate 
proteoglycans (HSP) is reduced or eliminated compared to a particle that 
expresses its native fiber, wherein: 

the adenovirus (Ad) particle, except for the fiber, is from a 
subgroup C adenovirus; and 

the fiber comprises fiber from Ad19p or Ad30, whereby HSP 
interaction is reduced. 

3. An adenovirus particle, comprising a heterologous fiber or a 
portion thereof, whereby binding of the viral particle to dendritic cells is 
increased compared to a particle that expresses its native fiber, wherein: 

the adenovirus (Ad) particle, except for the fiber, is from a 
subgroup C adenovirus; and 

the fiber comprises fiber from a subgroup B adenovirus for 
binding the virus to dendritic cells, wherein the subgroup B adenovirus is 
selected from the group consisting of adenovirus serotype 7, 11, 14, 21, 
34 or 50. 

4. A particle of any of claims 1-3, wherein: 
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the fiber is chimeric and comprises an N-terminal portion 
from a fiber of a subgroup C adenovirus; and 

the N-terminal portion is sufficient to increase incorporation 
into the particle compared to in its absence. 

5. The particle of claim 1 or 2, wherein the fiber is a chimeric 
fiber that includes a sufficient portion of a subgroup D adenovirus fiber to 
target dendritic cells. 

6. The particle of any of claims 1-5, wherein the subgroup C 
virus is selected from the group consisting of adenovirus serotype 1, 2, 5, 
and 6. 

7. The particle of any of claims 1-6, wherein the fiber is further 
modified to reduce any interaction with CAR. 

8. The particle of any of claims 1 and 3-7, wherein the fiber is 
modified to reduce any interaction with heparin sulfate proteoglycans 
(HSP). 

9. The particle of any of claims 1-8, wherein the capsid 
includes further modifications that alter interaction with a v integrin. 

10. The particle of any of claims 1 or 4-9, wherein the 
adenovirus (Ad) particle, except for the fiber, is from a subgroup C 
adenovirus; and 

the fiber is from Ad19p. 

11. The particle of claim 2 or claim 10, wherein the Ad19p fiber 
comprises at least a sufficient number of amino acids set forth as SEQ ID 
NO. 34 to target the particle to dendritic cells. 

1 2. The particle of claim 1 1 , wherein the Ad1 9p fiber comprises 
at least a sufficient number of amino acids set forth as SEQ ID NO. 34 to 
target the particle to dendritic cells, but exhibits reduced binding to HSP 
compared to a subgroup C fiber. 
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13. The particle of any of claims 10-12, wherein the fiber is 
chimeric and includes a portion of a subgroup C adenovirus. 

14. The particle of any of claims 1 or 4-9, wherein the 
adenovirus (Ad) particle, except for the fiber, is from a subgroup C 
adenovirus; and 

the fiber is from Ad30. 

15. The particle of claim 14, wherein the Ad30 fiber comprises 
at least a sufficient number of amino acids set forth as SEQ ID NO. 36 to 
target the particle to dendritic cells. 

16. The particle of claim 14, wherein the Ad30 fiber comprises 
at least a sufficient number of amino acids set forth as SEQ ID NO. 36 to 
target the particle to dendritic cells, but exhibits reduced binding to HSP 
compared to a subgroup C fiber. 

1 7. The particle of any of claims 1 4-1 6, wherein the fiber is 
chimeric and includes a portion of a subgroup C adenovirus. 

18. An adenovirus particle of any of claims 8-17, comprising a 
mutation in a CAR-binding region of the capsid to decrease CAR binding. 

19. An adenovirus particle of any of claims 1-18, comprising a 
mutation in the a y integrin-binding region of the capsid, whereby binding 
to the integrin is eliminated or reduced. 

20. The adenovirus particle of claim 1, 2, 4-13, 18 or 19, 
wherein the Ad19p fiber is modified by replacing the N-terminal 15, 16 or 
17 amino acids with the 15, 16 or 17 amino acids of an Ad2 or Ad5 
fiber. 

21. The adenovirus particle of claim 14-17, wherein the Ad30 
fiber is modified by replacing the N-terminal 15, 16 or 17 amino acids 
with the 15, 16 or 17 amino acids of an Ad2 or Ad5 fiber. 

22. The adenovirus particle of claim 7 or claim 18, wherein the 
CAR-binding region of the capsid that is modified is on a fiber knob. 
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23. The adenovirus particle of claim 22, wherein the fiber protein 
further comprises one or more further modifications that reduce or 
eliminate interaction of the resulting fiber with HSP. 

24. The adenovirus particle of claim 23, wherein the capsid 
further comprises a ligand, whereby the particle binds to a receptor for 
the ligand. 

25. The adenovirus particle of claim 24, wherein the ligand is 
included in the knob region of the fiber. 

26. The adenovirus particle of claim 24, wherein the ligand is 
inserted into the fiber or it replaces a portion of the fiber. 

27. A particle of any of claims 1-26, further comprising a 
heterologous nucleic acid in the genome thereof, wherein the 
heterologous nucleic acid encodes an antigen or a product that alters 
dendritic cell activity. 

28. The particle of claim 27, wherein the antigen is a tumor 
antigen or an antigen from a pathogen. 

29. A composition formulated for administration to a subject 
comprising a particle of any of claims 1-28. 

30. A composition of claim 29 formulated for intramuscular or IV 
or parenteral administration. 

31 . A composition of claim 29 or claim 30 that is a vaccine. 

32. An immunotherapeutic method, comprising administering a 
composition of any of claims 29-31 to a subject. 

33. A method of delivering viral particles to dendritic cells, 
comprising: 

contacting a composition with cells that comprise dendritic 
cells, whereby viral particles bind to dendritic cells, wherein the 
composition contains a viral particle of any of claims 1 and 3-28 or an 
adenovirus particle that comprises a fiber from Ad37 for targeting the 
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particle to dendritic cells and the adenovirus (Ad) particle, except for the 
fiber, is from a subgroup C adenovirus; and 

infusing the composition into a subject. 

34. The method of claim 33, wherein the cells are removed from 
the subject prior to contacting. 

35. The method of claim 33, wherein the cells comprise immune 

cells. 

36. The method of claim 33, wherein the cells are bone marrow 

cells. 

37. A nucleic acid molecule encoding a viral particle of any of 
claims 1-28. 

38. The nucleic acid molecule of claim 37 that comprises an 
adenovirus vector. 

39. The nucleic acid molecule of claim 37 or claim 38 that 
comprises heterologous nucleic acid. 

40. A cell, comprising the nucleic acid of any of claims 37-39. 

41 . The cell of claim 40 that is a dendritic cell. 

42. A method of treatment, comprising administering a cell to a 
subject who has an immune cell disorder, cancer or an infection, wherein 
the cell is a cell of claim 41 or a dendritic cell containing an adenovirus 
particle that comprises a fiber from Ad37 for targeting the particle to 
dendritic cells and the adenovirus (Ad) particle, except for the fiber, is 
from a subgroup C adenovirus. 

43. The method of claim 32 or 42, wherein the subject is 
infected with a pathogen, has a tumor, an inflammatory disorder, 
allergies, asthma or an autoimmune disease. 

44. A method of targeting an adenovirus particle to dendritic 
cells, comprising replacing all or a portion of the native fiber of the 
adenovirus with an adenovirus subgroup D fiber. 



WO 2004/099422 



PCT/US2004/009219 



-172- 

45. The method of claim 44, wherein: 

the adenovirus (Ad) particle, except for the fiber, is from a 
subgroup C adenovirus; and 

the subgroup D adenovirus is selected from the group 
consisting of adenovirus serotype 8, 9, 10, 13, 15, 17, 19a, 19p, 20, 
22-30, 32, 33, 36, 37, 38, 39 and 42-49. 

46. A method of targeting an adenovirus particle to dendritic 
cells, comprising replacing all or a portion of the native fiber of the 
adenovirus with an adenovirus subgroup B fiber. 

47. The method of claim 46, wherein: 

the adenovirus (Ad) particle, except for the fiber, is from a 
subgroup C adenovirus; and 

the subgroup B adenovirus is selected from the group 
consisting of adenovirus serotype 3, 7, 1 1 , 14, 1 6, 21 , 34, 35, or 50. 

48. The method of claim 45 or claim 47, wherein the subgroup C 
virus is selected from the group consisting of adenovirus serotypes 1, 2, 
5, and 6. 

49. The method of claim 45 or claim 47, wherein the fiber is 
further modified to reduce any interaction with CAR. 

50. The method of claim 49, wherein the fiber is further modified 
to reduce any interaction with heparin sulfate proteoglycans (HSP). 

51 . The method of claim 50 r wherein the capsid includes further 
modifications that alter interaction with a v integrin. 

52. Use of an adenovirus particle for treatment of a disorder or 
disease, wherein the particle is a particle of any of claims 1-28 or is an 
adenovirus particle that comprises a fiber from Ad37 for targeting the 
particle to dendritic cells and the adenovirus (Ad) particle, except for the 
fiber, is from a subgroup C adenovirus. 
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53. The use of claim 52, wherein the disease or disorder is an 
immune cell disorder, cancer or an infection. 

54. Use of an adenovirus particle for preparation of a medicament 
for the treatment of an immune cell disorder, cancer or an infection, 
wherein the particle is a particle of any of claims 1-28 or is an adenovirus 
particle that comprises a fiber from Ad37 for targeting the particle to 
dendritic cells and the adenovirus (Ad) particle, except for the fiber, is 
from a subgroup C adenovirus. 

55. Use of a cell for treatment of disease or disorder selected 
from an immune cell disorder, cancer and an infection, wherein the cell is 
a cell of claim 41 or a dendritic cell containing an adenovirus particle that 
comprises a fiber from Ad37 for targeting the particle to dendritic cells 
and the adenovirus (Ad) particle, except for the fiber, is from a subgroup 
C adenovirus. 

56. The use of claim 55, wherein the disorder is a tumor, an 
inflammatory disorder, allergies, asthma or an autoimmune disease. 

57. Use of a cell for the preparation of a medicament for the 
treatment of a disease or disorder selected from among an immune cell 
disorder, cancer and an infection, wherein the cell is a cell of claim 41 or 
a dendritic cell containing an adenovirus particle that comprises a fiber 
from Ad37 for targeting the particle to dendritic cells and the adenovirus 
(Ad) particle, except for the fiber, is from a subgroup C adenovirus. 

58. The use of claim 57, wherein the disease or disorder is a 
tumor, an inflammatory disorder, allergies, asthma or an autoimmune 
disease. 
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SEQUENCE LISTING 

<110> The Scripps Research Institute 
Von Seggern, Daniel J. 

<12 0> ADENOVIRUS PARTICLES WITH ENHANCED INFECTIVITY OF DENDRITIC CELLS AND 
PARTICLES WITH DECREASED INFECTIVITY OF HEPATOCYTES 

<130> 22908-1239PC 

<14 0> Not yet assigned 
<141> Herewith 

<150> US 60/459, 000 
<151> 2003-03-28 

<150> US 60/467,500 
<151> 2003-05-01 

<160> 118 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1746 

<212> DNA 

<213> Adenovirus type 5 

<220> 

<221> CDS 

<222> (1) . . . (1746) 

<400> 1 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 4 8 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gec ggc aac ctt acc tec 240 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

caa aat gta acc act gtg age cca cct etc aaa aaa acc aag tea aac 288 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 
85 90 ^ 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gec eta 336 
lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 
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act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gee ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 
130 135 140 

gee acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 4 80 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 
165 170 175 

gee tea ccc cct eta act act gee act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 
180 185 ^ ~ 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 20 0 2 05 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 768 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

gga gga eta agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gee cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 1 320 

gtt aac eta age act gec aag ggg ttg atg ttt gac get aca gee ata 10 0 8 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 335 

gee att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 1056 
Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
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Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 



age aca ggt gec att aca gta gga aac aaa aat aat gat aag eta act 
Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 ~ 390 395 400 



aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 
42 0 425 43 0 



gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 



aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 



tac att gec caa gaa taa 
Tyr He Ala Gin Glu * 
580 



1200 



ttg tgg acc aca cca get cca tct cct aac tgt aga eta aat gca gag 1248 
Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 
405 410 415 



1296 



ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 
435 440 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 13 92 
Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 
450 455 460 



1440 



aga aat gga gat ctt act gaa ggc aca gec tat aca aac get gtt gga 1488 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gec 153 6 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 



1584 



cct gta aca eta acc att aca eta aac ggt aca cag gaa aca gga gac 1632 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 540 

aca act cca agt gca tac tct atg tea ttt tea tgg gac tgg tct ggc 1680 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 , 555 560 

cac aac tac att aat gaa ata ttt gec aca tec tct tac act ttt tea 1728 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 



1746 



<210> 2 
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<211> 580 
<212> PRT 

<213> Adenovirus type 5 



<400> 2 



Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu Asp 


1 








5 








Asp 


Thr 


Glu 


Thr 
20 


Gly 


Pro 


Pro 


Thr 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 


Glu 






35 










40 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 




50 










55 




Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


65 










70 






Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 










85 








Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 


Leu 








100 










Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 






115 










120 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 




130 










135 




Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


145 








150 






Ser 


Gly 


Pro 


Leu 


Thr 


Thr Thr Asp 










165 








Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 


Thr 








180 










Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 


Gly 






195 










200 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 


Leu 




210 










215 




Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


Thr 


225 










230 






Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin Gly 










245 








Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 


Asn 








260 










Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 


Gin 






275 










280 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


His 




290 










295 




Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


Ser 


305 










310 






Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 


Leu 










325 








He 


Asn 


Ala 


Gly 


Asp 


Gly 


Leu 


Glu 








340 










Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 


He 






355 










360 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 


Leu 




370 










375 




Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly Asn 


385 










390 






Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Pro 










405 








Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 








420 










Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 






435 










440 



Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


Tyr 




10 










15 




Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


Phe 


25 










30 






Ser 


Pro 


Pro 


Gly 


Val 


Leu 


Ser 


Leu 










45 








Ser 


Asn 


Gly Met 


Leu 


Ala 


Leu 


Lys 








60 










Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 


Gin 






75 










80 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 


He 




90 










95 




Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


Thr 


105 










110 






Val 


Ala 


Gly Asn Thr 


Leu 


Thr 


Met 










125 








His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 


Ala 








140 










Glu Gly Lys 


Leu 


Ala 


Leu 


Gin 


Thr 






155 










160 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 


Ala 




170 










175 




Gly 


Ser 


Leu Gly 


He 


Asp 


Leu 


Lys 


185 










190 






Lys 


Leu Gly Leu 


Lys 


Tyr 


Gly 


Ala 










205 








Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 


Gly 








220 








Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 


Gly 






235 










240 


Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 


Gly 




250 










255 


Arg 


Arg 


Leu 


He 


Leu 


Asp 


Val 


Ser 


265 










270 






Leu 


Asn 


Leu Arg 


Leu 


Gly 


Gin 


Gly 










285 








Asn 


Leu Asp 


He 


Asn 


Tyr 


Asn 


Lys 








300 










Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


Val 






315 










320 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 


Ala 




330 










335 




Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


Asn 


345 








350 






Gly 


His 


Gly 


Leu 


Glu 


Phe 


Asp 


Ser 










365 








Gly 


Thr 


Gly 


Leu 


Ser 


Phe 


Asp 


Ser 








380 










Lys 


Asn 


Asn 


Asp 


Lys 


Leu 


Thr 


Leu 






395 










400 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 


Lys 




410 










415 




Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 


Leu 


425 










430 






Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 


Ser 



445 
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Gly Thr 


Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 


Gly 




450 










455 










460 










Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


Arg 


465 










470 










475 










480 


Asn 


Gly 


Asp 


Leu Thr Glu Gly 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 


Phe 










485 










490 










495 




Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


His 


Gly Lys 


Thr 


Ala 


Lys 








500 










505 










510 






Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 


Pro 






515 










520 










525 








Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn 


Gly Thr 


Gin 


Glu 


Thr 


Gly Asp 


Thr 




530 










535 










540 










Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser 


Gly 


His 


545 










550 










555 










560 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe 


Ser 


Tyr 










565 










570 








575 


He 


Ala 


Gin 


Glu 



























580 



<210> 3 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 

<221> CDS 
<222> (1) . . . (1746) 

<400> 3 

atg aag cgc gca aga 
Met Lys Arg Ala Arg 
1 5 

tat gac acg gaa acc 
Tyr Asp Thr Glu Thr 
20 

ttt gta tec ccc aat 
Phe Val Ser Pro Asn 
35 

ttg cgc eta tec gaa 
Leu Arg Leu Ser Glu 
50 

aaa atg ggc aac ggc 
Lys Met Gly Asn Gly 
65 

caa aat gta acc act 
Gin Asn Val Thr Thr 
85 

ata aac ctg gaa ata 
He Asn Leu Glu He 
100 

act gtg get gee gee 



ccg tct gaa gat acc ttc aac ccc gtg tat cca 48 
Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
10 15 

ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
25 30 

ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
40 45 

cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
55 60 

etc tct ctg gac gag gee ggc aac ctt acc tec 240 
Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
70 75 80 

gtg age cca cct etc aaa aaa acc aag tea aac 288 
Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 
90 95 

tct gca ccc etc aca gtt acc tea gaa gee eta 33 6 
Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
105 110 

gca cct eta atg gtc gcg ggc aac aca etc acc 384 
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Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 
130 135 140 

gee acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 48 0 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 
165 170 175 

gec tea ccc cct eta act act gee act ggt age ttg gge att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 
180 185 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 2 00 2 05 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gec ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 768 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

gg a gga eta agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gec cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

gtt aac eta age act gec aag ggg ttg atg ttt gac get aca gee ata 10 0 8 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 ' 335 

gec att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 1056 
Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 ~ 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 
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355 360 365 

1152 



tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 

age aca ggt gee att aca gta gga aac aaa aat aat gat aag eta act 
Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 ^ 390 395 400 



aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie 
420 425 430 



gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp. Asn Phe 
465 470 475 480 

aga aat gga gat ctt act gaa ggc aca gec tat aca aac get gtt gga 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 



aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 



tac att gec caa gaa taa 
Tyr He Ala Gin Glu * 
580 



1200 



ttg tgg ace aca cca get cca gag get aac tgt aga eta aat gca gag 124 8 
Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 
405 410 415 



1296 



ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 
435 440 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 13 92 
Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 
450 455 460 



1440 



1488 



ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gee 1536 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 



1584 



cct gta aca eta acc att aca eta aac ggt aca cag gaa aca gga gac 1632 
Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 540 

aca act cca agt gca tac tct atg tea ttt tea tgg gac tgg tct ggc 168 0 
Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

cac aac tac att aat gaa ata ttt gee aca tec tct tac act ttt tea 172 8 
His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 
565 570 575 



1746 



<210> 4 
<211> 581 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 

<400> 4 _, m 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

20 " 25 30 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 

85 90 95 

lie Asn Leu Glu He Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 HO 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 " 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

180 185 190 

Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 " 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 ^ 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 ~ 33 0 3 35 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 J 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 
420 425 ' 430 
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Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val Lys 


Gly 


Ser Leu 


Ala 


Pro 


He 






435 










440 


445 








Ser 


Gly 
450 


Thr 


Val 


Gin 


Ser 


Ala 
455 


His 


Leu He 


He 


Arg Phe 
460 


Asp 


Glu 


Asn 


Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu Asp 


Pro 


Glu Tyr 


Trp 


Asn 


Phe 


465 










470 








475 








480 


Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly Thr Ala 


Tyr 


Thr Asn 


Ala 


Val 


Gly 










485 








490 








495 




Phe 


Met 


Pro 


Asn 
500 


Leu 


Ser 


Ala 


Tyr 


Pro Lys 
505 


Ser 


His Gly 


Lys 
510 


Thr 


Ala 


Lys 


Ser 


Asn 
515 


He 


Val 


Ser 


Gin 


Val 
520 


Tyr Leu 


Asn 


Gly Asp 
525 


Lys 


Thr 


Lys 


Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu Asn Gly 


Thr 


Gin Glu Thr Gly 


Asp 




530 










535 








540 








Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser Phe 


Ser 


Trp Asp 


Trp 


Ser 


Gly 


545 










550 








555 








560 


His 


Asn 


Tyr 


He 


Asn 
565 


Glu 


He 


Phe 


Ala Thr 
570 


Ser 


Ser Tyr 


Thr 


Phe 
575 


Ser 


Tyr 


He 


Ala 


Gin 
580 


Glu 





















<210> 5 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOIRGD 

<221> CDS 

<222> (1) . . . (1746) 

<400> 5 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 4 8 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gee ggc aac ctt acc tec 24 0 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

caa aat gta acc act gtg age cca cct etc aaa aaa acc aag tea aac 2 88 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 
85 90 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gee eta 33 6 
He Asn Leu Glu He Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 
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act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 
130 135 140 

gee acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 4 80 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 
165 170 175 

gee tea ccc cct eta act act gee act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 
180 185 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 ~ 200 205 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 76 8 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

99 a 99 a cta a 99 att 9 at tct caa aac a 9 a C 9 C ctt ata ctt 9 at 9 fct 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 ~ 270 

agt tat ccg ttt gat get caa aac caa cta aat cta aga cta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gee cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 96 0 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 " 310 315 320 

gtt aac cta age act gee aag ggg ttg atg ttt gac get aca gee ata 10 0 8 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 335 

gec att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 10 56 
Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 350 
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aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 360 ~ 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 

age aca ggt gee att aca gta gga aac aaa aat aat gat aag eta act 12 0 0 
Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 " 400 

ttg tgg ace aca cca get cca tct cct aac tgt aga eta aat gca gag 124 8 
Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 
405 410 415 

aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 12 96 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 
420 425 ~ 430 

ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro lie 
435 440 " 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 1392 
Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 
450 455 460 

gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 1440 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 ' ~ 480 

aga aat gga gat ctt act gaa ggc aca gee tat aca aac get gtt gga 1488 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gee 1536 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 

aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 1584 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 

cct gta aca eta ace att aca eta aac ggt aca cag gaa aca ggt gat 1632 
Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
53 0 535 540 

cat tgt gat tgt cgt ggt gat tgt ttt tgt aca act cca agt gca tac 16 80 
His Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Thr Pro Ser Ala Tyr 
545 550 555 560 

tct atg tea ttt tea tgg gac tgg tct ggc cac aac tac att aat gaa 172 8 
Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu 
565 570 575 

ata ttt gee aca tec tct tacacttttt catacattgc ccaagaataa 1776 
He Phe Ala Thr Ser Ser 
580 
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<211> 582 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOIRGD 



<400> 6 



Met 
1 


Lys 


Arg 


Ala 


Arg 
5 


Pro 


Ser 


Glu 


Tyr 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 








20 










Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 






35 










40 


Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 




50 










55 




Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


65 










70 






Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 










85 








He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 








100 










Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 






115 










120 


Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 




130 










135 




Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


145 








150 






Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 










165 








Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 








180 










Lys 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 






195 










200 


Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 




210 










215 




Gly 


Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


225 










230 






Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin 










245 








Gly Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 








260 










Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 






275 








280 


Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 




290 










295 




Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


305 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 










325 








Ala 


He 


Asn 


Ala 


Gly 


Asp 


Gly 


Leu 








340 










Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 






355 










360 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 




370 










375 




Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly 


385 










390 






Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 








405 
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Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


25 










30 






Glu 


Ser 


Pro 


Pro 


Gly 


Val 


Leu 


Ser 










45 








Thr 


Ser 


Asn 


Gly Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 






75 










80 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










110 






Met 


Val 


Ala 


Gly Asn Thr Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 




170 










175 




Thr 


Gly 


Ser 


Leu 


Gly 


He 


Asp 


Leu 


185 










190 






Gly Lys 


Leu 


Gly 


Leu 


Lys 


Tyr 


Gly 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 










240 


Gly Asn Met 


Gin 


Leu 


Asn 


Val 


Ala 




250 










255 




Asn Arg Arg 


Leu 


He 


Leu Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu Arg 


Leu 


Gly 


Gin 










285 






His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 










335 




Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


345 










350 






He 


Gly His 


Gly 


Leu 


Glu 


Phe 


Asp 










365 








Leu 


Gly 


Thr 


Gly 


Leu 


Ser 


Phe 


Asp 








380 










Asn 


Lys 


Asn 


Asn Asp 


Lys 


Leu 


Thr 






395 










400 


Pro 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 




410 






415 
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Lys 


Asp 


Ala 


Lys 








420 


Leu 


Ala 


Thr 


Val 






435 




Sex* 


("2.1 \r 


Thr 


V d_L 




450 






Gly 


Val 


Leu 


Leu 


465 








Arg 


Asn 


Gly 


Asp 


Phe 


Met 


Pro 


Asn 








500 


Lys 


Ser 


Asn 


He 






515 




Pro 


Val 


Thr 


Leu 




530 






His 


Cys 


Asp 


Cys 


545 








Ser 


Met 


Ser 


Phe 


lie 


Phe 


Ala 


Thr 








580 



Leu 


Thr 


Leu 


Val 


Ser 


Val 


Leu 


Ala 








440 


Gin 


Ser 


Ala 


His 






455 




Asn 


Asn 


Ser 


Phe 




470 






Leu 


Thr 


Glu 


Gly 


485 








Leu 


Ser 


Ala 


Tyr 


Val 


Ser 


Gin 


Val 








520 


Thr 


He 


Thr 


Leu 






535 




Arg 


Gly 


Asp 


Cys 




550 






Ser 


Trp 


Asp 


Trp 


565 








Ser 


Ser 







Leu Thr Lys Cys 
425 

Val Lys Gly Ser 

Leu lie He Arg 
460 

Leu Asp Pro Glu 
475 

Thr Ala Tyr Thr 
490 

Pro Lys Ser His 
505 

Tyr Leu Asn Gly 

Asn Gly Thr Gin 
540 

Phe Cys Thr Thr 
555 

Ser Gly His Asn 
570 



Glv 


Ser 


Gin 


He 




430 






Leu 


Ala 


Pro 


He 


445 








Phe 


Asp 


Glu 


Asn 


Tyr 


Trp 


Asn 


Phe 








480 


Asn 


Ala 


Val 


Gly 






495 




Gly 


Lys 


Thr 


Ala 




510 






Asp 


Lys 


Thr 


Lys 


525 








Glu 


Thr 


Gly 


Asp 


Pro 


Ser 


Ala 


Tyr 








560 


Tyr 


He 


Asn 


Glu 






575 





<210> 7 
<211> 1746 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> 5F K012 

<221> CDS 

<222> (1) . . . (1746) 

<400> 7 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 4 8 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gee ggc aac ctt acc tec 24 0 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

caa aat gta acc act gtg age cca cct etc aaa aaa acc aag tea aac 2 88 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 
85 90 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gee eta 33 6 
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Ile Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 

act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 
130 135 140 

gee acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 480 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 
165 170 175 

gee tea ccc cct eta act act gec act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 
180 185 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 200 205 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 " 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 768 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

99 a 99 a cta a 99 a ^t gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gec cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 ~ ~ 310 315 320 

gtt aac eta age act gee aag ggg ttg atg ttt gac get aca gee ata 10 08 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 335 

gee att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 1056 
Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
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340 345 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 

age aca ggt gee att aca gta gga aac aaa aat aat gat aag eta act 12 00 
Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

ttg tgg acc aca cca get cca tct cct aac tgt tea eta aat gga ggc 1248 
Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Ser Leu Asn Gly Gly 
405 410 415 

gga gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 12 96 
Gly Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie 
420 425 * 430 

ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 
435 440 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 13 92 
Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 
450 455 460 

gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 1440 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

aga aat gga gat ctt act gaa ggc aca gec tat aca aac get gtt gga 14 88 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gec 1536 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 

aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 15 84 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 

cct gta aca eta acc att aca eta aac ggt aca cag gaa aca gga gac 1632 
Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 4 540 

aca act cca agt gca tac tct atg tea ttt tea tgg gac tgg tct ggc 1680 
Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

cac aac tac att aat gaa ata ttt gee aca tec tct tac act ttt tea 1728 
His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 
565 570 ^ 575 

tac att gee caa gaa taa 1746 
Tyr He Ala Gin Glu * 
580 
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<210> 8 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K012 



<400> 8 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


1 








5 








T y r 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 








20 










Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 






35 










40 


Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 




50 










55 




Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


65 










70 






Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 










85 








He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 








100 










Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 






115 










120 


Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 




130 










135 




Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


145 










150 






Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 










165 








Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 








180 










Lys 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 






195 










200 


Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 




210 










215 




Gly Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


225 










230 






Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin 










245 








Gly Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 








260 










Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 






275 










280 


Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


290 










295 




Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


305 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 










325 








Ala 


He 


Asn 


Ala 


Gly 


Asp 


Gly 


Leu 








340 










Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 






355 










360 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 




370 










375 




Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly 


385 










390 







-16- 



Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


25 










30 






Glu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 










45 








Thr 


Ser 


Asn 


Gly Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 






75 










80 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










110 






Met 


Val 


Ala 


Gly Asn Thr 


Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 




170 










175 




Thr 


Gly 


Ser 


Leu Gly 


He 


Asp 


Leu 


185 










190 






Gly Lys 


Leu 


Gly Leu 


Lys 


Tyr 


Gly 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 










240 


Gly Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 




250 










255 




Asn Arg 


Arg 


Leu 


He 


Leu Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu Arg Leu Gly 


Gin 










285 








His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 










335 




Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


345 










350 






He 


Gly 


His 


Gly 


Leu 


Glu 


Phe 


Asp 










365 








Leu 


Gly 


Thr 


Gly 


Leu 


Ser 


Phe 


Asp 








380 










Asn 




Asn 


Asn Asp 


Lys 


Leu 


Thr 






395 










400 
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Leu 


Trp 


Thr 


Thr 


Gly Asp 


Ala 


Lys 








420 


Leu 


Ala 


Thr 


Val 






435 




Ser 


Gly 


± I1X 


V cL-L 




450 






Gly Val 


Leu 


Leu 


465 








Arg 


Asn 


Gly 


Asp 


Phe 


Met 


Pro 


Asn 








500 


Lys 


Ser 


Asn 


He 






515 




Pro 


Val 


Thr 


Leu 




530 






Thr 


Thr 


Pro 


Ser 


545 








His 


Asn 


Tyr 


He 


Tyr 


He 


Ala 


Gin 








580 



Pro Ala Pro Ser 
405 

Leu Thr Leu Val 

Ser Val Leu Ala 
440 

Gin Ser Ala His 
455 

Asn Asn Ser Phe 
470 

Leu Thr Glu Gly 
485 

Leu Ser Ala Tyr 

Val Ser Gin Val 
520 

Thr He Thr Leu 
535 

Ala Tyr Ser Met 
550 

Asn Glu He Phe 

565 

Glu 



Pro Asn Cys Ser 
410 

Leu Thr Lys Cys 
425 

Val Lys Gly Ser 

Leu He He Arg 
460 

Leu Asp Pro Glu 
475 

Thr Ala Tyr Thr 
490 

Pro Lys Ser His 
505 

Tyr Leu Asn Gly 

Asn Gly Thr Gin 
540 

Ser Phe Ser Trp 
555 

Ala Thr Ser Ser 
570 



Leu Asn Gly Gly 
415 

Gly Ser Gin He 
430 

Leu Ala Pro He 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Asp Trp Ser Gly 
560 

Tyr Thr Phe Ser 
575 



<210> 9 
<211> 1686 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F S* 

<221> CDS 

<222> (1) . . . (1746) 

<400> 9 

acc ggt cct cca act gtg cct ttt ctt act cct ccc ttt gta tec ccc 48 

Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro Phe Val Ser Pro 

15 10 15 

aat ggg ttt caa gag agt ccc cct ggg gta etc tct ttg cgc eta tec 96 
Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser Leu Arg Leu Ser 
20 25 30 

gaa cct eta gtt acc tec aat ggc atg ctt gcg etc aaa atg ggc aac 144 
Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu Lys Met Gly Asn 
35 40 45 

ggc etc tct ctg gac gag gee ggc aac ctt acc tec caa aat gta acc 192 
Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser Gin Asn Val Thr 
50 55 60 

act gtg age cca cct etc gga gee gga gee tea aac ata aac ctg gaa 24 0 
Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn He Asn Leu Glu 
65 70 75 80 

ata tct gca ccc etc aca gtt acc tea gaa gee eta act gtg get gee 28 8 
He Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu Thr Val Ala Ala 
85 90 95 
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gcc gca cct eta atg gtc gcg ggc aac aca etc acc atg caa tea cag 33 6 
Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr Met Gin Ser Gin 
100 105 110 

gec ccg eta acc gtg cac gac tec aaa ctt age att gee acc caa gga 3 84 
Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie Ala Thr Gin Gly 
115 120 125 

ccc etc aca gtg tea gaa gga aag eta gee ctg caa aca tea ggc ccc 432 
Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin Thr Ser Gly Pro 
130 135 " 140 

etc acc acc acc gat age agt acc ctt act ate act gee tea ccc cct 480 
Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr Ala Ser Pro Pro 
145 150 155 160 

eta act act gee act ggt age ttg ggc att gac ttg aaa gag ccc att 52 8 
Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu Lys Glu Pro lie 
165 170 175 

tat aca caa aat gga aaa eta gga eta aag tac ggg get cct ttg cat 576 
Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly Ala Pro Leu His 
180 185 190 

gta aca gac gac eta aac act ttg acc gta gca act ' ggt cca ggt gtg 624 
Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr Gly Pro Gly Val 
195 200 205 

act att aat aat act tec ttg caa act aaa gtt act gga gee ttg ggt 672 
Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr Gly Ala Leu Gly 
210 215 220 

ttt gat tea caa ggc aat atg caa ctt aat gta gca gga gga eta agg 72 0 
Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala Gly Gly Leu Arg 
225 230 235 240 

att gat tct caa aac aga cgc ctt ata ctt gat gtt agt tat ccg ttt 76 8 
lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val Ser Tyr Pro Phe 
245 250 255 

gat get caa aac caa eta aat eta aga eta gga cag ggc cct ctt ttt 816 
Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin Gly Pro Leu Phe 
260 265 270 

ata aac tea gee cac aac ttg gat att aac tac aac aaa ggc ctt tac 864 
lie Asn Ser Ala His Asn Leu Asp lie Asn Tyr Asn Lys Gly Leu Tyr 
275 280 285 

ttg ttt aca get tea aac aat tec aaa aag ctt gag gtt aac eta age 912 
Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu Val Asn Leu Ser 
290 295 300 

act gee aag ggg ttg atg ttt gac get aca gee ata gee att aat gca 96 0 
Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala lie Ala He Asn Ala 
305 310 315 320 

gga gat ggg ctt gaa ttt ggt tea cct aat gca cca aac aca aat ccc 10 08 
Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro Asn Thr Asn Pro 
325 330 335 
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ctc aaa aca aaa atfc ggc cat ggc eta gaa ttt gat tea aac aag get 
Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp Ser Asn Lys Ala 
340 345 350 



1056 



atg gtt cct aaa eta gga act ggc ctt agt ttt gac age aca ggt gee 
Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp Ser Thr Gly Ala 
355 360 365 



1104 



att aca gta gga aac aaa aat aat gat aag eta act ttg tgg ace aca 
lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr Leu Trp Thr Thr 
370 375 380 



1152 



cca get cca tct cct aac tgt aga eta aat gca gag aaa gat get aaa 
Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu Lys Asp Ala Lys 
385 390 " 395 400 



1200 



etc act ttg gtc tta aca aaa tgt ggc agt caa ata ctt get aca gtt 
Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie Leu Ala Thr Val 
405 410 415 



1248 



tea gtt ttg get gtt aaa ggc agt ttg get cca ata tct gga aca gtt 
Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro lie Ser Gly Thr Val 
420 425 430 



1296 



caa agt get cat ctt att ata aga ttt gac gaa aat gga gtg eta eta 
Gin Ser Ala His Leu lie lie Arg Phe Asp Glu Asn Gly Val Leu Leu 
435 440 ~ 445 



1344 



aac aat tec ttc ctg gac cca gaa tat tgg aac ttt aga aat gga gat 
Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe Arg Asn Gly Asp 
450 ~ 455 460 



1392 



ctt act gaa ggc aca gee tat aca aac get gtt gga ttt atg cct aac 
Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly Phe Met Pro Asn 
465 470 475 480 



1440 



eta tea get tat cca aaa tct cac ggt aaa act gee aaa agt aac att 
Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala Lys Ser Asn lie 
485 490 495 



1488 



gtc agt caa gtt tac tta aac gga gac aaa act aaa cct gta aca eta 
Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys Pro Val Thr Leu 
500 505 510 



1536 



acc att aca eta aac ggt aca cag gaa aca gga gac aca act cca agt 
Thr lie Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp Thr Thr Pro Ser 
515 520 525 



1584 



gca tac tct atg tea ttt tea tgg gac tgg tct ggc cac aac tac att 
Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr lie 
530 535 540 



1632 



aat gaa ata ttt gee aca tec tct tac act ttt tea tac att gee caa 
Asn Glu lie Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr lie Ala Gin 
545 550 555 " 560 



1680 



gaa taa 
Glu * 



1686 
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<210> 10 
<211> 561 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S* 



<400> 10 



Thr 


Gly 


Pro 


Pro 


Thr 


Val 


Pro 


Phe 


1 








5 








Asn 


Gly 


Phe 


Gin 


Glu 


Ser 


Pro 


Pro 








20 










Glu 


Pro 


Leu 


Val 


Thr 


Ser 


Asn 


Gly 






35 










40 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala 


Gly 




50 










55 




Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Gly 


Ala 


65 










70 




He 


Ser 


Ala 


Pro 


Leu 


Thr 


Val 


Thr 










85 








Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly 








100 










Ala 


Pro 


Leu 


Thr 


Val 


His 


Asp 


Ser 






115 








120 


Pro 


Leu 


Thr 


Val 


Ser 


Glu 


Gly 


Lys 




130 










135 




Leu 


Thr 


Thr 


Thr 


Asp 


Ser 


Ser 


Thr 


145 










150 






Leu 


Thr 


Thr 


Ala 


Thr 


Gly 


Ser 


Leu 










165 






Tyr 


Thr 


Gin 


Asn 


Gly 


Lys 


Leu 


Gly 








180 










Val 


Thr 


Asp 


Asp 


Leu 


Asn 


Thr 


Leu 






195 










200 


Thr 


He 


Asn 


Asn 


Thr 


Ser 


Leu 


Gin 




210 










215 




Phe 


Asp 


Ser 


Gin 


Gly 


Asn 


Met 


Gin 


225 










230 






He 


Asp 


Ser 


Gin 


Asn 


Arg 


Arg 


Leu 










245 








Asp 


Ala 


Gin 


Asn 


Gin 


Leu 


Asn 


Leu 








260 










He 


Asn 


Ser 


Ala 


His 


Asn 


Leu 


Asp 






275 










280 


Leu 


Phe 


Thr 


Ala 


Ser 


Asn 


Asn 


Ser 




290 










295 




Thr 


Ala 


Lys 


Gly 


Leu 


Met 


Phe 


Asp 


305 










310 






Gly 


Asp 


Gly 


Leu 


Glu 


Phe 


Gly 


Ser 










325 








Leu 


Lys 


Thr 


Lys 


He 


Gly 


His 


Gly 








340 










Met 


Val 


Pro 


Lys 


Leu 


Gly 


Thr 


Gly 






355 










360 


He 


Thr 


Val 


Gly 


Asn 


Lys 


Asn 


Asn 




370 










375 
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Leu 


Thr 


Pro 


Pro 


Phe 


Val 


Ser 


Pro 




10 










15 




Gly Val 


Leu 


Ser 


Leu 


Arg 


Leu 


Ser 


25 










30 






Met 


Leu 


Ala 


Leu 


Lys 


Met 


Gly 


Asn 










45 








Asn 


Leu 


Thr 


Ser 


Gin 


Asn 


Val 


Thr 








60 










Gly Ala 


Ser 


Asn 


He 


Asn 


Leu 


Glu 






75 










80 


Ser 


Glu 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 




90 










95 




Asn 


Thr 


Leu 


Thr 


Met 


Gin 


Ser 


Gin 


105 










110 






Lys 


Leu 


Ser 


He 


Ala 


Thr 


Gin 


Gly 










125 






Leu 


Ala 


Leu 


Gin 


Thr 


Ser 


Gly 


Pro 








140 








Leu 


Thr 


He 


Thr 


Ala 


Ser 


Pro 


Pro 






155 










160 


Gly 


He 


Asp 


Leu 


Lys 


Glu 


Pro 


He 




170 










175 




Leu 


Lys 


Tyr 


Gly Ala 


Pro 


Leu 


His 


185 










190 






Thr 


Val 


Ala 


Thr 


Gly 


Pro 


Gly 


Val 










205 








Thr 


Lys 


Val 


Thr Gly Ala 


Leu 


Gly 








220 










Leu 


Asn 


Val 


Ala Gly Gly 


Leu 


Arg 






235 










240 


He 


Leu 


Asp 


Val 


Ser 


Tyr 


Pro 


Phe 




250 










255 




Arg 


Leu 


Gly 


Gin 


Gly 


Pro 


Leu 


Phe 


265 










270 






He 


Asn 


Tyr 


Asn 


Lys 


Gly 


Leu 


Tyr 










285 








Lys 


Lys 


Leu 


Glu 


Val 


Asn 


Leu 


Ser 








300 










Ala 


Thr 


Ala 


He 


Ala 


He 


Asn 


Ala 






315 










320 


Pro 


Asn 


Ala 


Pro 


Asn 


Thr 


Asn 


Pro 




330 










335 




Leu 


Glu 


Phe 


Asp 


Ser 


Asn 


Lys 


Ala 


345 










350 






Leu 


Ser 


Phe 


Asp 


Ser 


Thr 


Gly 


Ala 










365 








Asp 


Lys 


Leu 


Thr 


Leu 


Trp 


Thr 


Thr 



380 
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Pro 


Ala 


Pro 


Ser 


Pro 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 


Lys 


Asp 


Ala 


Lys 


385 










390 








395 










400 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 


Leu 


Ala 


Thr 


Val 










405 








410 










415 




Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 


Ser 


Gly 


Thr 


Val 








420 






425 










430 






Gin 


Ser 


Ala 


His 


Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 


Gly Val 


Leu 


Leu 






435 










440 










445 








Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


Arg Asn Gly Asp 




450 










455 










460 










Leu 


Thr 


Glu 


Gly Thr Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 


Phe 


Met 


Pro 


Asn 


465 










470 










475 










480 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


His 


Gly Lys 


Thr 


Ala 


Lys 


Ser 


Asn 


He 










485 










490 










495 




Val 


Ser 


Gin 


Val 


Tyr 


Leu Asn Gly 


Asp 


Lys 


Thr 


Lys 


Pro 


Val 


Thr 


Leu 








500 










505 










510 






Thr 


He 


Thr 


Leu Asn Gly Thr 


Gin 


Glu Thr Gly Asp 


Thr 


Thr 


Pro 


Ser 






515 










520 










525 








Ala 


Tyr 
530 


Ser 


Met 


Ser 


Phe 


Ser 
535 


Trp 


Asp 


Trp 


Ser 


Gly 
540 


His 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe 


Ser 


Tyr 


He 


Ala 


Gin 


545 










550 










555 










560 


Glu 

































<210> 11 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 

<221> CDS 

<222> (1) . . . (1746) 

<400> 11 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 48 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gec ggc aac ctt acc tec 24 0 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

caa aat gta acc act gtg age cca cct etc gga gec gga gec tea aac 2 88 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala- Gly Ala Ser Asn 
85 90 95 
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ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gec eta 336 
lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 

act gtg get gee gee gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gee ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 
130 135 140 

gee acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 480 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 
165 170 175 

gee tea ccc cct eta act act gee act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 
180 185 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 ~ 200 205 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 768 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

gga- 993- eta agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gee cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 " 310 315 320 

gtt aac eta age act gee aag ggg ttg atg ttt gac get aca gee ata 1008 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 335 
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gcc att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 
Ala lie Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 350 



1056 



aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 



age aca ggt gee att aca gta gga aac aaa aat aat gat aag eta act 12 0 0 
Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

ttg tgg ace aca cca get cca tct cct aac tgt aga eta aat gca gag 124 8 
Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 
405 410 415 

aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 1296 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie 
420 425 430 

ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro lie 
435 440 ~~ 445 



tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 
Ser Gly Thr Val Gin Ser Ala His Leu lie lie Arg Phe Asp Glu Asn 
450 455 460 



1392 



gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 



1440 



aga aat gga gat ctt act gaa ggc aca gee tat aca aac get gtt gga 14 88 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 ' 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gee 1536 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 

aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 15 84 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 52 0 525 

cct gta aca eta acc att aca eta aac ggt aca cag gaa aca ggt gat 1632 
Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 540 

cat tgt gat tgt cgt ggt gat tgt ttt tgt aca act cca agt gca tac 1680 
His Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Thr Pro Ser Ala Tyr 
545 550 555 560 

tct atg tea ttt tea tgg gac tgg tct ggc cac aac tac att aat gaa 1728 
Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu 
565 570 575 

ata ttt gee aca tec tct tacacttttt catacattgc ccaagaataa 1776 



WO 2004/099422 



PCT/US2004/009219 
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580 



<210> 12 
<211> 582 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F S*RGD 



<400> 12 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


1 








5 








Tyr Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 








20 










Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 






35 










40 


Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 




50 










55 




Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


65 










70 






Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 










85 








He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 








100 










Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 






115 










120 


Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 




130 










135 




Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


145 










150 






Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 










165 








Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 








180 










Lys 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 






195 










200 


Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 




210 










215 




Gly 


Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


225 










230 






Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin 










245 








Gly Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 








260 










Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 






275 










280 


Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 




290 










295 




Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


305 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 










325 








Ala 


He 


Asn 


Ala 


Gly 


Asp 


Gly 


Leu 








340 










Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 






355 










360 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 



-24- 





Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


25 










30 






Giu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 










45 








Thr 


Ser Asn Gly Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn Leu 


Thr 


Ser 






75 










80 


Pro 


Leu Gly Ala Gly Ala 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










110 






Met 


Val 


Ala Gly Asn Thr 


Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 




170 










175 




Thr 


Gly 


Ser 


Leu 


Gly 


He 


Asp 


Leu 


185 










190 






Gly 


Lys 


Leu 


Gly 


Leu 


Lys 


Tyr 


Gly 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 










240 


Gly Asn Met 


Gin 


Leu 


Asn 


Val 


Ala 




250 










255 




Asn 


Arg Arg 


Leu 


He 


Leu 


Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu Arg 


Leu 


Gly 


Gin 










285 








His 


Asn 


Leu Asp 


He 


Asn 


Tyr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 










335 




Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


345 










350 






He 


Gly His 


Gly Leu Glu 


Phe 


Asp 










365 








Leu 


Gly Thr Gly 


Leu 


Ser 


Phe 


Asp 
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370 375 380 



Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly Asn Lys Asn Asn Asp 


Lys 


Leu 


Thr 


385 








390 










395 










400 


Leu. 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Pro 


Asn 


cys 


Arg 


Leu 


Asn 


Ala 


Glu 








405 










410 










415 




Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 




420 










425 










430 






Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 






435 










440 










445 








Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 




450 










455 










460 










Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 


465 










470 










475 










480 


Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 




485 










490 










495 




Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


His 


Gly 


Lys 


Thr 


Ala 








500 










505 










510 






Lys 


Ser 


Asn 


lie 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 


Gly Asp 


Lys 


Thr 


Lys 




515 










520 










525 








Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu Asn Gly Thr Gin Glu 


Thr 


Gly 


As P 




530 










535 










540 










His 


Cys 


Asp 


Cys 


Arg 


Gly 


Asp 


Cys 


Phe 


Cys 


Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


545 








550 










555 










560 


Ser 


Met 


Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser Gly His 


Asn 


Tyr 


He 


Asn 


Glu 










565 










570 










575 




lie 


Phe 


Ala 


Thr 


Ser 


Ser 























580 



<210> 13 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F K01S* 

<221> CDS 

<222> (1) . . . (1746) 

<400> 13 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 48 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
1 ' 5 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 * 25 3 0 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gee ggc aac ctt acc tec 240 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 " 70 75 80 
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caa aat gta acc act gtg age cca cct etc gga gec gga gec tea aac 2 88 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 
85 90 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gec eta 33 6 

lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 

act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 
130 135 140 

gec acc caa gga ccc etc aca gtg tea gaa gga aag eta gee ctg caa 480 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 

145 ~ 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 
165 ~ 170 175 

gee tea ccc cct eta act act gec act ggt age ttg ggc att gac ttg 576 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 
180 185 " 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 200 205 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 

Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 

225 * 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 768 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

QQci eta agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 

Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gec cac aac ttg gat att aac tac aac 912 

Gly Pro Leu Phe lie Asn Ser Ala His Asn Leu Asp lie Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 

305 310 315 320 

gtt aac eta age act gee aag ggg ttg atg ttt gac get aca gee ata 10 0 8 
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Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala lie 
325 330 335 

gcc att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 1056 
Ala lie Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 . 345 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 " 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 

age aca ggt gcc att aca gta gga aac aaa aat aat gat aag eta act 12 0 0 
Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

ttg tgg acc aca cca get cca gag get aac tgt aga eta aat gca gag 1248 
Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 
405 410 415 

aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa at a 12 96 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 
420 425 430 

ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 
435 440 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 13 92 
Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 
450 455 460 

gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 144 0 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

aga aat gga gat ctt act gaa ggc aca gcc tat aca aac get gtt gga 14 88 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gcc 153 6 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 

aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 15 84 
Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 

cct gta aca eta acc att aca eta aac ggt aca cag gaa aca gga gac 1632 
Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 540 

aca act cca agt gca tac tct atg tea ttt tea tgg gac tgg tct ggc 1680 
Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

cac aac tac att aat gaa ata ttt gcc aca tec tct tac act ttt tea 1728 
His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 
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565 570 575 

tac att gcc caa gaa taa 1746 
Tyr He Ala Gin Glu * 
580 



<210> 14 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 



<400> 14 



Met 


Lys 


Arg Ala 


Arg 


Pro 


Ser 


Glu Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 






5 










10 










15 




Tyr 


Asp 


Thr Glu 
20 


Thr 


Gly 


Pro 


Pro 


Thr 
25 


Val 


Pro 


Phe 


Leu 


Thr 
30 


Pro 


Pro 


Phe 


Val 


Ser Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 






35 








40 










45 








Leu Arg 


Leu Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser 


Asn 


Gly Met 


Leu 


Ala 


Leu 




50 








55 










60 










Lys 


Met 


Gly Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 


65 








70 










75 










80 


Gin 


Asn 


Val Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Gly 


Ala 


Gly Ala 


Ser 


Asn 








85 










90 










95 




He 


Asn 


Leu Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val 


Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala 


Gly Asn 


Thr 


Leu 


Thr 






115 








120 










125 








Met 


Gin 


Ser Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 




130 








135 








140 








Ala 


Thr 


Gin Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 


145 








150 










155 








160 


Thr 


Ser 


Gly Pro 


Leu 
165 


Thr 


Thr 


Thr 


Asp 


Ser 
170 


Ser 


Thr 


Leu 


Thr 


He 
175 


Thr 


Ala 


Ser 


Pro Pro 


Leu 


Thr 


Thr 


Ala 


Thr Gly 


Ser 


Leu 


Gly 


He 


Asp 


Leu 






180 










185 










190 




Lys 


Glu 


Pro He 


Tyr 


Thr 


Gin 


Asn Gly Lys 


Leu 


Gly Leu Lys 


Tyr 


Gly 






195 








200 










205 




Ala 


Pro 
210 


Leu His 


Val 


Thr 


Asp 
215 


Asp 


Leu 


Asn 


Thr 


Leu 

220 


Thr 


Val 


Ala 


Thr 


Gly Pro 


Gly Val 


Thr 


lie 


Asn 


Asn 


Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 


225 








230 










235 








240 


Gly Ala 


Leu Gly 


Phe 


Asp 


Ser 


Gin Gly Asn Met 


Gin 


Leu 


Asn 


Val 


Ala 








245 










250 










255 




Gly Gly 


Leu Arg 


He 


Asp 


Ser 


Gin 


Asn Arg Arg 


Leu 


He 


Leu 


Asp 


Val 






260 










265 










270 




Ser 


Tyr 


Pro Phe 


Asp 


Ala 


Gin 


Asn 


Gin 


Leu 


Asn 


Leu 


Arg 


Leu Gly Gin 






275 








280 










285 








Gly 


Pro 


Leu Phe 


He 


Asn 


Ser 


Ala 


His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 




290 








295 










300 








Lys Gly 


Leu Tyr 


Leu 


Phe 


Thr 


Ala 


Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 


305 








310 










315 






320 


Val 


Asn 


Leu Ser 


Thr 


Ala 


Lys 


Gly 


Leu 


Met 


Phe 


Asp Ala Thr Ala 


He 








325 










330 










335 




Ala 


lie 


Asn Ala 


Gly 


Asp 


Gly 


Leu 


Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 






340 










345 








350 






Asn 


Thr 


Asn Pro 


Leu 


Lys 


Thr 


Lys 


He Gly His 


Gly Leu Glu 


Phe 


Asp 
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355 










360 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 




370 










375 




Ser 


Thr 


Gly 


Ala 


He 


Thr 


Val 


Gly 


385 










390 






Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Glu 










405 








Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 








420 










Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 






435 










440 


Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 




450 










455 




Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


465 










470 






Arg 


Asn 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly 










485 








Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 








500 










Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 




515 










520 


Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 




530 










535 




Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


545 










550 






His 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 








565 








Tyr 


He 


Ala 


Gin 


Glu 









580 



365 



Leu 


Gly Thr Gly Leu 


Ser 


Phe 


Asp 








380 










Asn 


Lys 


Asn Asn Asp 


Lys 


Leu 


Thr 






395 










400 


Ala 


Asn 


Cys 


Arg 


Leu 


Asn 


Ala 


Glu 




410 










415 




Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 


425 










430 






Val 


Lys 


Gly 


Ser 


Leu 


Ala 


Pro 


He 










445 








Leu 


He 


He 


Arg 


Phe 


Asp 


Glu 


Asn 








460 










Leu 


Asp 


Pro 


Glu 


Tyr 


Trp 


Asn 


Phe 






475 










480 


Thr 


Ala 


Tyr 


Thr 


Asn 


Ala 


Val 


Gly 




490 










495 




Pro 


Lys 


Ser 


His 


Gly 


Lys 


Thr 


Ala 


505 










510 






Tyr 


Leu Asn Gly Asp 


Lys 


Thr 


Lys 










525 








Asn 


Gly Thr Gin Glu Thr 


Gly 


Asp 








540 










Ser 


Phe 


Ser 


Trp 


Asp 


Trp 


Ser 


Gly 






555 










560 


Ala 


Thr 


Ser 


Ser 


Tyr 


Thr 


Phe 


Ser 




570 










575 





<210> 15 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F K01S*RGD 

<221> CDS 

<222> (1) . . . (1776) 

<400> 15 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 48 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 ~ 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 



aaa atg ggc aac ggc etc tct ctg gac gag gec ggc aac ctt acc tec 24 0 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
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65 70 75 80 

caa aat gta acc act gtg age cca cct etc gga gec gga gec tea aac 2 88 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 
85 90 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gec eta 336 
lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 

act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 
130 135 140 

gec acc caa gga ccc etc aca gtg tea gaa gga aag eta gec ctg caa 48 0 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 52 8 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 
165 170 175 

gee tea ccc cct eta act act gee act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 
180 185 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 200 205 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 76 8 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

99 a 99 a ct a agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gec cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
'290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 
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gtt aac eta age act gec aag ggg ttg atg ttt gac get aca gec ata 10 0 8 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala lie 
325 330 335 

gec att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 10 56 
Ala lie Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 ^ 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 
355 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 

age aca ggt gee att aca gta gga aac aaa aat aat gat aag eta act 12 0 0 
Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

ttg tgg ace aca cca get cca gag get aac tgt aga eta aat gca gag 124 8 
Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 
405 410 ~ 415 

aaa gat get aaa etc act ttg gtc tta aca aaa tgt ggc agt caa ata 1296 
Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie 
420 425 ~ ~ 430 

ctt get aca gtt tea gtt ttg get gtt aaa ggc agt ttg get cca ata 1344 
Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro lie 
435 440 445 

tct gga aca gtt caa agt get cat ctt att ata aga ttt gac gaa aat 1392 
Ser Gly Thr Val Gin Ser Ala His Leu lie lie Arg Phe Asp Glu Asn 
450 455 460 

gga gtg eta eta aac aat tec ttc ctg gac cca gaa tat tgg aac ttt 144 0 
Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

aga aat gga gat ctt act gaa ggc aca gee tat aca aac get gtt gga 14 88 
Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 
485 490 495 

ttt atg cct aac eta tea get tat cca aaa tct cac ggt aaa act gee 153 6 
Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 

aaa agt aac att gtc agt caa gtt tac tta aac gga gac aaa act aaa 15 84 
Lys Ser Asn lie Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 
515 520 525 

cct gta aca eta acc att aca eta aac ggt aca cag gaa aca ggt gat 1632 
Pro Val Thr Leu Thr lie Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
530 535 ' 540 

cat tgt gat tgt cgt ggt gat tgt ttt tgt aca act cca agt gca tac 1680 
His Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Thr Pro Ser Ala Tyr 
545 550 ~ 555 560 
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tct atg tea ttt tea tgg gac tgg tct ggc cac aac tac att aat gaa 1728 
Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu 
565 570 575 

ata ttt gec aca tec tct tac act ttt tea tac att gec caa gaa taa 1776 
He Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr He Ala Gin Glu * 
580 585 590 



<210> 16 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K01S*RGD 



<400> 16 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


1 








5 








Tyr Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 








20 










Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 






35 








40 


Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 




50 










55 




Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


65 










70 






Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 










85 








He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 








100 










Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 






115 










120 


Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 




130 










135 




Ala 


Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


145 










150 






Thr 


Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 








165 








Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 








180 










Lys 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 






195 










200 


Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 




210 










215 




Gly 


Pro 


Gly 


Val 


Thr 


He 


Asn 


Asn 


225 










230 






Gly Ala 


Leu 


Gly 


Phe 


Asp 


Ser 


Gin 










245 








Gly 


Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 








260 










Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 






275 










280 


Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 


290 










295 




Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


305 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 



325 



Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


10 










15 




Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


25 










30 






Glu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 










45 








Thr 


Ser 


Asn 


Gly Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn Leu Thr 


Ser 




75 










80 


Pro 


Leu 


Gly 


Ala Gly Ala 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










110 






Met 


Val 


Ala 


Gly Asn Thr 


Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 


170 










175 




Thr 


Gly 


Ser 


Leu Gly 


He 


Asp 


Leu 


185 










190 






Gly 


Lys 


Leu 


Gly Leu 


Lys 


Tyr 


Gly 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 










240 


Gly 


Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 




250 










255 




Asn 


Arg 


Arg 


Leu 


He 


Leu 


Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu 


Arg 


Leu 


Gly 


Gin 










285 








His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 








335 
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Ala 


lie 


Asn 


Ala 








340 


Asn 


Thr 


Asn 


Pro 






355 




Ser 


Asn 


Lys 


7\ *1 = 




370 






Ser 


Thr 


Gly 


Ala 


385 








Leu 


Trp 


Thr 


Thr 


Lys 


Asp 


Ala 


Lys 








420 


Leu 


Ala 


Thr 


Val 






435 




Ser 


Gly 


Thr 


Val 




450 






Gly 


Val 


Leu 


Leu 


465 








Arg 


Asn 


Gly 


Asp 


Phe 


Met 


Pro 


Asn 








500 


l y s 


Ser 


Asn 


He 






515 




Pro 


Val 


Thr 


Leu 




530 






His 


Cys 


Asp 


Cys 


545 








Ser 


Met 


Ser 


Phe 


lie 


Phe 


Ala 


Thr 








580 



Gly 


Asp 


Gly 


Leu 


Leu 


Lys 


Thr 


Lys 








360 


Met 


Val 


Pro 


Lys 






375 




He 


Thr 


tt_ n 

Val 


GJLy 




390 






Pro 


Ala 


Pro 


Glu 


405 








Leu 


Thr 


Leu 


Val 


Ser 


Val 


Leu 


Ala 








440 


Gin 


Ser 


Ala 


His 






455 




Asn 


Asn 


Ser 


Phe 




470 






Leu 


Thr 


Glu 


Gly 


485 








Leu 


Ser 


Ala 


Tyr 


Val 


Ser 


Gin 


Val 








520 


Thr 


He 


Thr 


Leu 






535 




Arg 


Gly 


Asp 


Cys 




550 






Ser 


Trp 


Asp 


Trp 


565 








Ser 


Ser 


Tyr 


Thr 



Glu 


Phe 


Gly 


Ser 


345 








He 


Gly 


His 


Gly 


Leu 


Gly 


Thr 


Gly 








380 


Asn 


Lys 


Asn 


Asn 






395 




/\j_a 


Asn 


Cys 


Arg 




410 






Leu 


Thr 


Lys 


Cys 


425 








Val 


Lys 


Gly 


Ser 


Leu 


He 


He 


Arg 








460 


Leu 


Asp 


Pro 


Glu 






475 




Thr 


Ala 


Tyr 


Thr 




490 






Pro 


Lys 


Ser 


His 


505 








Tyr 


Leu 


Asn 


Gly 


Asn 


Gly 


Thr 


Gin 








540 


Phe 


Cys 


Thr 


Thr 






555 




Ser 


Gly 


His 


Asn 




570 






Phe 


Ser 


Tyr 


He 



585 



Pro Asn Ala Pro 
350 

Leu Glu Phe Asp 
365 

Leu Ser Phe Asp 

Asp Lys Leu Thr 
400 

Leu Asn Ala Glu 
415 

Gly Ser Gin He 
430 

Leu Ala Pro He 
445 

Phe Asp Glu Asn 

Tyr Trp Asn Phe 
480 

Asn Ala Val Gly 
495 

Gly Lys Thr Ala 
510 

Asp Lys Thr Lys 
525 

Glu Thr Gly Asp 

Pro Ser Ala Tyr 
560 

Tyr He Asn Glu 
575 

Ala Gin Glu 
590 



<210> 17 
<211> 972 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad35 fiber 

<221> CDS 

<222> (1) . . . (972) 

<400> 17 

atg acc aag aga gtc egg etc agt gac tec ttc aac cct gtc tac ccc 48 
Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 
1 5 10 15 

tat gaa gat gaa age acc tec caa cac ccc ttt ata aac cca ggg ttt 96 
Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe He Asn Pro Gly Phe 
20 25 30 

att tec cca aat ggc ttc aca caa age cca gac gga gtt ctt act tta 144 
He Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 40 45 



aaa tgt tta acc cca eta aca acc aca ggc gga tct eta cag eta aaa 192 
Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 
50 55 ~ 60 
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9^-9 99^ 999 99& ctt aca gtg gat gac act gat ggt acc tta caa gaa 24 0 
Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 70 75 80 

aac ata cgt get aca gca ccc att act aaa aat aat cac tct gta gaa 2 88 
Asn He Arg Ala Thr Ala Pro He Thr Lys Asn Asn His Ser Val Glu 
85 90 95 

eta tec att gga aat gga tta gaa act caa aac aat aaa eta tgt gee 33 6 
Leu Ser He Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 
100 105 110 

aaa ttg gga aat ggg tta aaa ttt aac aac ggt gac att tgt ata aag 3 84 
Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp He Cys He Lys 
115 120 125 

gat agt att aac acc tta tgg act gga ata aac cct cca cct aac tgt 432 
Asp Ser He Asn Thr Leu Trp Thr Gly He Asn Pro Pro Pro Asn Cys 
130 135 140 

caa att gtg gaa aac act aat aca aat gat ggc aaa ctt act tta gta 48 0 
Gin He Val Glu Asn Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val 
145 150 ~ 155 160 

tta gta aaa aat gga ggg ctt gtt aat ggc tac gtg tct eta gtt ggt 52 8 
Leu Val Lys Asn Gly Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly 
165 170 175 

gta tea gac act gtg aac caa atg ttc aca caa aag aca gca aac ate 576 
Val Ser Asp Thr Val Asn Gin Met Phe Thr Gin Lys Thr Ala Asn He 
180 185 190 

caa tta aga tta tat ttt gac tct tct gga aat eta tta act gag gaa 624 
Gin Leu Arg Leu Tyr Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu 
195 200 205 

tea gac tta aaa att cca ctt aaa aat aaa tct tct aca gcg acc agt 672 
Ser Asp Leu Lys He Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser 
210 215 220 

gaa act gta gee age age aaa gee ttt atg cca agt act aca get tat 72 0 
Glu Thr Val Ala Ser Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr 
225 230 235 240 

ccc ttc aac acc act act agg gat agt gaa aac tac att cat gga ata 768 
Pro Phe Asn Thr Thr Thr Arg Asp Ser Glu Asn Tyr He His Gly He 
245 250 255 

tgt tac tac atg act agt tat gat aga agt eta ttt ccc ttg aac att 816 
Cys Tyr Tyr Met Thr Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He 
260 265 270 

tct ata atg eta aac age cgt atg att tct tec aat gtt gee tat gee 864 
Ser He Met Leu Asn Ser Arg Met He Ser Ser Asn Val Ala Tyr Ala 
275 280 285 

ata caa ttt gaa tgg aat eta aat gca agt gaa tct cca gaa age aac 912 
He Gin Phe Glu Trp Asn Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn 
290 295 300 



WO 2004/099422 



PCT/US2004/009219 



-35- 



ata get acg ctg acc aca tec ccc ttt ttc ttt tct tac att aca gaa 960 
lie Ala Thr Leu Thr Thr Ser Pro Phe Phe Phe Ser Tyr lie Thr Glu 
305 310 315 320 

gac gac gaa taa 9 72 

Asp Asp Glu * 



<210> 18 
<211> 323 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad 35 fiber 



<400> 18 



Met 


Thr 


Lys 


Arg 


Val 


Arg 


Leu 


Ser 


1 








5 








Tyr 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 








20 










He 


Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 






35 










40 


Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 




50 










55 




Val 


Gly 


Gly 


Gly 


Leu 


Thr 


Val 


Asp 


65 










70 






Asn 


He 


Arg 


Ala 


Thr 


Ala 


Pro 


He 










85 








Leu 


Ser 


He 


Gly 


Asn 


Gly 


Leu 


Glu 








100 










Lys 


Leu 


Gly 


Asn 


Gly 


Leu 


Lys 


Phe 






115 










120 


Asp 


Ser 


He 


Asn 


Thr 


Leu 


Trp 


Thr 




130 










135 




Gin 


He 


Val 


Glu 


Asn 


Thr 


Asn 


Thr 


145 










150 






Leu 


Val 


Lys 


Asn 


Gly 


Gly 


Leu 


Val 










165 








Val 


Ser 


Asp 


Thr 


Val 


Asn 


Gin 


Met 








180 










Gin 


Leu 


Arg 


Leu 


Tyr 


Phe 


Asp 


Ser 






195 










200 


Ser Asp 


Leu 


Lys 


He 


Pro 


Leu 


Lys 




210 










215 




Glu 


Thr 


Val 


Ala 


Ser 


Ser 


Lys 


Ala 


225 










230 






Pro 


Phe 


Asn 


Thr 


Thr 


Thr 


Arg 


Asp 










245 








Cys 


Tyr 


Tyr 


Met 


Thr 


Ser 


Tyr 


Asp 








260 










Ser 


He 


Met 


Leu 


Asn 


Ser 


Arg 


Met 






275 










280 


He 


Gin 


Phe 


Glu 


Trp 


Asn 


Leu 


Asn 




290 










295 




He 


Ala 


Thr 


Leu 


Thr 


Thr 


Ser 


Pro 


305 










310 






Asp 


Asp 


Glu 













Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


10 










15 




His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 


25 










30 






Ser 


Pro 


Asp 


Gly Val 


Leu 


Thr 


Leu 










45 








Thr 


Gly Gly 


Ser 


Leu 


Gin 


Leu 


Lys 








60 










Asp 


Thr 


Asp 
75 


Gly 


Thr 


Leu 


Gin 


Glu 
80 


Thr 


Lys 
90 


Asn 


Asn 


His 


Ser 


Val 
95 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


C Y S 


Ala 


105 










110 






Asn 


Asn 


Gly 


Asp 


He 
125 


Cys 


He 


Lys 


Gly 


He 


Asn 


Pro 
140 


Pro 


Pro 


Asn 


Cys 


Asn 


Asp 


Gly 
155 


Lys 


Leu 


Thr 


Leu 


Val 
160 


Asn 


Gly 
170 


Tyr 


Val 


Ser 


Leu 


Val 
175 


Gly 


Phe 


Thr 


Gin 


Lys 


Thr 


Ala 


Asn 


He 


185 








190 






Ser 


Gly Asn 


Leu 


Leu 


Thr 


Glu 


Glu 










205 








Asn 


Lys 


Ser 


Ser 


Thr 


Ala 


Thr 


Ser 






220 










Phe 


Met 


Pro 
235 


Ser 


Thr 


Thr 


Ala 


Tyr 
240 


Ser 


Glu 
250 


Asn 


Tyr 


He 


His 


Gly 
255 


He 


Arg 


Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 


265 










270 






He 


Ser 


Ser 


Asn 


Val 
285 


Ala 


Tyr 


Ala 


Ala 


Ser 


Glu 


Ser 
300 


Pro 


Glu 


Ser 


Asn 


Phe 


Phe 


Phe 
315 


Ser 


Tyr 


He 


Thr 


Glu 
320 
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<210> 19 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 35F RGD 

<221> CDS 

<222> (1) . . . (1002) 

<400> 19 

atg acc aag aga gtc egg etc agt gac tec ttc aac cct gtc tac ccc 48 
Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gaa gat gaa age acc tec caa cac ccc ttt ata aac cca ggg ttt 96 
Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 
20 25 30 

att tec cca aat ggc ttc aca caa age cca gac gga gtt ctt act tta 144 
lie Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 40 45 

aaa tgt tta acc cca eta aca acc aca ggc gga tct eta cag eta aaa 192 
Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 
50 55 60 

gtg gga ggg gga ctt aca gtg gat gac act gat ggt acc tta caa gaa 24 0 
Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 70 75 80 

aac ata cgt get aca gca ccc att act aaa aat aat cac tct gta gaa 28 8 
Asn lie Arg Ala Thr Ala Pro lie Thr Lys Asn Asn His Ser Val Glu 
85 90 95 

eta tec att gga aat gga tta gaa act caa aac aat aaa eta tgt gee 336 
Leu Ser lie Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 
100 105 110 

aaa ttg gga aat ggg tta aaa ttt aac aac ggt gac att tgt ata aag 3 84 
Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp lie Cys lie Lys 
115 ^ ^ 120 ~ 125 

gat agt att aac acc tta tgg act gga ata aac cct cca cct aac tgt 432 
Asp Ser lie Asn Thr Leu Trp Thr Gly lie Asn Pro Pro Pro Asn Cys 
130 135 140 

caa att gtg gaa aac act aat aca aat gat ggc aaa ctt act tta gta 48 0 
Gin lie Val Glu Asn Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val 
145 150 155 160 

tta gta aaa aat gga ggg ctt gtt aat ggc tac gtg tct eta gtt ggt 52 8 
Leu Val Lys Asn Gly Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly 
165 170 175 

gta tea gac act gtg aac caa atg ttc aca caa aag aca gca aac ate 576 
Val Ser Asp Thr Val Asn Gin Met Phe Thr Gin Lys Thr Ala Asn He 
180 185 190 



caa tta aga tta tat ttt gac tct tct gga aat eta tta act gag gaa 



624 
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Gln Leu Arg Leu Tyr Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu 
195 ^ 200 205 

tea gac tta aaa att cca ctt aaa aat aaa tct tct aca gcg acc agt 
Ser Asp Leu Lys He Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser 
210 215 220 

gaa act gta gec age age aaa gec ttt atg cca agt act aca get tat 
Glu Thr Val Ala Ser Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr 
225 230 235 240 



tct ata atg eta aac age cgt atg att tct tec aat gta cat tgt gat 
Ser He Met Leu Asn Ser Arg Met He Ser Ser Asn Val His Cys Asp 
275 280 285 



672 



720 



ccc ttc aac acc act act agg gat agt gaa aac tac att cat gga ata 768 
Pro Phe Asn Thr Thr Thr Arg Asp Ser Glu Asn Tyr He His Gly He 
245 250 255 

tgt tac tac atg act agt tat gat aga agt eta ttt ccc ttg aac att 816 
Cys Tyr Tyr Met Thr Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He 
260 265 270 



864 



tgt cgt ggt gat tgt ttt tgc gca tat gee ata caa ttt gaa tgg aat 912 
Cys Arg Gly Asp Cys Phe Cys Ala Tyr Ala He Gin Phe Glu Trp Asn 
290 " ^ 295 300 

eta aat gca agt gaa tct cca gaa age aac ata get acg ctg acc aca 960 
Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn He Ala Thr Leu Thr Thr 
305 310 315 320 

tec ccc ttt ttc ttt tct tac att aca gaa gac gac gaa taa 1002 
Ser Pro Phe Phe Phe Ser Tyr He Thr Glu Asp Asp Glu * 
325 330 



<210> 20 
<211> 332 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 3 5FRGD 



<400> 20 



Thr 


Lys 


Arg 


Val Arg 


Leu 


Ser 


Asp 


Ser Phe Asn 


Pro 


Val 


Tyr 


Pro 


Tyr 


1 


5 








10 








15 




Glu 


Asp 


Glu 


Ser Thr 


Ser 


Gin 


His 


Pro Phe He 


Asn 


Pro 


Gly 


Phe 


He 






20 








25 






30 






Ser 


Pro 


Asn 


Gly Phe 


Thr 


Gin 


Ser 


Pro Asp Gly Val 


Leu 


Thr 


Leu 


Lys 






35 






40 






45 








Cys 


Leu 


Thr 


Pro Leu 


Thr 


Thr 


Thr 


Gly Gly Ser 


Leu 


Gin 


Leu 


Lys 


Val 


50 








55 






60 










Gly Gly 


Gly 


Leu Thr 


Val 


Asp 


Asp 


Thr Asp Gly 


Thr 


Leu 


Gin 


Glu 


Asn 


65 






70 






75 










80 


He 


Arg 


Ala 


Thr Ala 


Pro 


He 


Thr 


Lys Asn Asn 


His 


Ser 


Val 


Glu 


Leu 






85 








90 








95 




Ser 


He 


Gly 


Asn Gly 


Leu 


Glu 


Thr 


Gin Asn Asn 


Lys 


Leu 


Cys 


Ala 


Lys 






100 








105 






110 






Leu 


Gly 


Asn 


Gly Leu 


Lys 


Phe Asn Asn Gly Asp 


He 


Cys 


He 


Lys 


Asp 
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115 




Ser 


He 


Asn 


Thr 




130 






He 


Val 


Glu 


Asn 


145 








Val 


Lys 


Asn 


Gly 


Ser Asp 


Thr 


Val 








180 


Leu 


Arg 


Leu 


Tyr 






195 




Asp 


Leu 


Lys 


He 




210 






Thr 


Val 


Ala 


Ser 


225 








Phe 


Asn 


Thr 


Thr 


T Y r 


Tyr 


Met 


Thr 








260 


He 


Met 


Leu 


Asn 






275 




Arg Gly 


Asp 


Cys 




290 






As n 


Ala 


Ser 


Glu 


305 








Pro 


Phe 


Phe 


Phe 



120 

Leu Trp Thr Gly 
135 

Thr Asn Thr Asn 
150 

Gly Leu Val Asn 
165 

Asn Gin Met Phe 

Phe Asp Ser Ser 
200 

Pro Leu Lys Asn 
215 

Ser Lys Ala Phe 
230 

Thr Arg Asp Ser 
245 

Ser Tyr Asp Arg 

Ser Arg Met He 
280 

Phe Cys Ala Tyr 
2 95 

Ser Pro Glu Ser 
310 

Ser Tyr He Thr 
325 



He Asn 


Pro 


Pro 






140 


Asp Gly 


Lys 


Leu 




155 




Gly Tyr 


Val 


Ser 


170 






Thr Gin 


Lys 


Thr 


185 




Gly Asn 


Leu 


Leu 


Lys Ser 


Ser 


Thr 






220 


Met Pro 


Ser 


Thr 




235 




Glu Asn 


Tyr 


He 


250 






Ser Leu 


Phe 


Pro 


265 






Ser Ser 


Asn 


Val 


Ala He 


Gin 


Phe 






300 


Asn He 


Ala 


Thr 




315 




Glu Asp 


Asp 


Glu 


330 







125 

Pro Asn Cys Gin 

Thr Leu Val Leu 
160 

Leu Val Gly Val 
175 

Ala Asn He Gin 
190 

Thr Glu Glu Ser 
205 

Ala Thr Ser Glu 

Thr Ala Tyr Pro 
240 

His Gly He Cys 
255 

Leu Asn He Ser 
270 

His Cys Asp Cys 
285 

Glu Trp Asn Leu 

Leu Thr Thr Ser 
320 



<210> 21 
<211> 1164 
<212> DNA ( 

<213> Artificial Sequence 
<220> 

<223> Ad41 short fiber 

<221> CDS 

<222> (1) . . . (1164) 

<400> 21 

atg aaa aga acc aga att gaa gac gac ttc aac ccc gtc tac ccc tat 4 8 

Met Lys Arg Thr Arg He Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

15 10 15 

gac acc ttc tea act ccc age ate ccc tat gta get ccg ccc ttc gtt 96 
Asp Thr Phe Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 
20 25 30 

tct tct gac ggg tta cag gaa aaa ccc cca gga gtt tta gca etc aag 144 
Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ala Leu Lys 
35 40 45 

tac act gac ccc att act acc aat get aag cat gag ctt act tta aaa 192 
Tyr Thr Asp Pro He Thr Thr Asn Ala Lys His Glu Leu Thr Leu Lys 
50 55 60 

ctt gga age aac at a act tta gaa aat ggg tta ctt teg gee aca gtt 240 
Leu Gly Ser Asn He Thr Leu Glu Asn Gly Leu Leu Ser Ala Thr Val 
65 70 " 75 80 
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ccc act gtt tct cct ccc ctt aca aac agt aac aac tec ctg ggt tta 2 88 
Pro Thr Val Ser Pro Pro Leu Thr Asn Ser Asn Asn Ser Leu Gly Leu 
85 90 95 

gec aca tec get ccc ata get gta tea get aac tct etc aca ttg gee 336 
Ala Thr Ser Ala Pro lie Ala Val Ser Ala Asn Ser Leu Thr Leu Ala 
100 105 110 

ace gee gca cca ctg aca gta age aac aac cag ctt agt att aac gcg 3 84 
Thr Ala Ala Pro Leu Thr Val Ser Asn Asn Gin Leu Ser lie Asn Ala 
115 120 125 

ggc aga ggt tta gtt ata act aac aat gee tta aca gtt aat cct ace 432 
Gly Arg Gly Leu Val lie Thr Asn Asn Ala Leu Thr Val Asn Pro Thr 
130 135 140 

gga gcg eta ggt ttc aat aac aca gga get tta caa tta aat get gca 480 
Gly Ala Leu Gly Phe Asn Asn Thr Gly Ala Leu Gin Leu Asn Ala Ala 
145 150 155 160 

gga gga atg aga gtg gac ggt gee aac tta att ctt cat gta gca tat 52 8 
Gly Gly Met Arg Val Asp Gly Ala Asn Leu lie Leu His Val Ala Tyr 
165 170 175 

ccc ttt gaa gca ate aac cag eta aca ctg cga tta gaa aac ggg tta 576 
Pro Phe Glu Ala lie Asn Gin Leu Thr Leu Arg Leu Glu Asn Gly Leu 
180 185 190 

gaa gta ace age gga gga aag ctt aac gtt aag ttg gga tea ggc etc 624 
Glu Val Thr Ser Gly Gly Lys Leu Asn Val Lys Leu Gly Ser Gly Leu 
195 200 ~ 205 

caa ttt gac agt aac gga cgc att get att agt aat age aac cga act 672 
Gin Phe Asp Ser Asn Gly Arg lie Ala lie Ser Asn Ser Asn Arg Thr 
210 215 220 

cga agt gta cca tec etc act ace att tgg tct ate teg cct acg cct 72 0 
Arg Ser Val Pro Ser Leu Thr Thr lie Trp Ser He Ser Pro Thr Pro 
225 230 235 240 

aac tgc tec att tat gaa ace caa gat gca aac eta ttt ctt tgt eta 768 
Asn Cys Ser He Tyr Glu Thr Gin Asp Ala Asn Leu Phe Leu Cys Leu 
245 250 255 

act aaa aac gga get cac gta tta ggt act ata aca ate aaa ggt ctt 816 
Thr Lys Asn Gly Ala His Val Leu Gly Thr He Thr He Lys Gly Leu 
260 265 270 

aaa gga gca ctg egg gaa atg cac gat aac get eta tct tta aaa ctt 8 64 
Lys Gly Ala Leu Arg Glu Met His Asp Asn Ala Leu Ser Leu Lys Leu 
275 280 285 

ccc ttt gac aat cag gga aat tta ctt aac tgt gee ttg gaa tea tec 912 
Pro Phe Asp Asn Gin Gly Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser 
290 295 300 

acc tgg cgt tac cag gaa ace aac gca gtg gee tct aat gee tta aca 960 
Thr Trp Arg Tyr Gin Glu Thr Asn Ala Val Ala Ser Asn Ala Leu Thr 
305 310 315 320 

ttt atg ccc aac agt aca gtg tat cca cga aac aaa acc get cac ccg 1008 
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Phe Met Pro Asn Ser Thr 
325 

ggc aac atg etc ate caa 
Gly Asn Met Leu lie Gin 
340 

tac aac gag ata aac agt 
Tyr Asn Glu lie Asn Ser 
355 

gaa ccg gga aaa cct ttt 
Glu Pro Gly Lys Pro Phe 
370 

act gaa gaa taa 
Thr Glu Glu * 
385 



Val Tyr Pro Arg Asn Lys 
330 

ate teg cct aac ate acc 
lie Ser Pro Asn lie Thr 
345 

ggg tat get ttt act ttt 
Gly Tyr Ala Phe Thr Phe 
360 

cac cca cct acc get gta 
His Pro Pro Thr Ala Val 
375 380 



Thr Ala His Pro 
335 

ttc agt gtc gtc 1056 
Phe Ser Val Val 
350 

aaa tgg tea gee 1104 

Lys Trp Ser Ala 

365 

ttt tgc tac ata 1152 
Phe Cys Tyr lie 



1164 



<210> 22 

<211> 387 

<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Ad41 short fiber 



<400> 22 



Met 


Lys 


Arg 


Thr 


Arg 


He 


Glu Asp Asp Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


Tyr 


1 








5 






10 










15 




Asp 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser 


He Pro Tyr 


Val 


Ala 


Pro 


Pro 


Phe 


Val 






20 








25 








30 






Ser 


Ser 


Asp 


Gly 


Leu 


Gin 


Glu 


Lys Pro Pro 


Gly Val 


Leu 


Ala 


Leu 


Lys 






35 










40 






45 








Tyr 


Thr 


Asp 


Pro 


He 


Thr 


Thr 


Asn Ala Lys 


His 


Glu 


Leu 


Thr 


Leu 


Lys 


50 








55 






60 










Leu 


Gly 


Ser 


Asn 


He 


Thr 


Leu 


Glu Asn Gly 


Leu 


Leu 


Ser 


Ala 


Thr 


Val 


65 








70 






75 










80 


Pro 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Thr Asn Ser 


Asn 


Asn 


Ser 


Leu 


Gly 


Leu 










85 






90 










95 




Ala 


Thr 


Ser 


Ala 


Pro 


He 


Ala 


Val Ser Ala 


Asn 


Ser 


Leu 


Thr 


Leu 


Ala 








100 








105 








110 






Thr 


Ala 


Ala 


Pro 


Leu 


Thr 


Val 


Ser Asn Asn 


Gin 


Leu 


Ser 


He 


Asn 


Ala 






115 










120 






125 








Gly Arg 


Gly 


Leu 


Val 


He 


Thr 


Asn Asn Ala 


Leu 


Thr 


Val 


Asn 


Pro 


Thr 




130 










135 






140 










Gly Ala 


Leu 


Gly 


Phe 


Asn 


Asn Thr Gly Ala 


Leu 


Gin 


Leu 


Asn 


Ala 


Ala 


145 








150 






155 










160 


Gly 


Gly 


Met 


Arg 


Val 


Asp 


Gly Ala Asn Leu 


He 


Leu 


His 


Val 


Ala 


Tyr 










165 






170 










175 




Pro 


Phe 


Glu 


Ala 


He 


Asn 


Gin 


Leu Thr Leu 


Arg 


Leu 


Glu 


Asn 


Gly 


Leu 








180 








185 








190 






Glu 


Val 


Thr 


Ser 


Gly 


Gly 


Lys 


Leu Asn Val 


Lys 


Leu 


Gly 


Ser 


Gly 


Leu 






195 










200 






205 








Gin 


Phe 


Asp 


Ser 


Asn 


Gly 


Arg 


He Ala He 


Ser 


Asn 


Ser 


Asn 


Arg 


Thr 




210 








215 






220 










Arg 


Ser 


Val 


Pro 


Ser 


Leu 


Thr 


Thr He Trp 


Ser 


He 


Ser 


Pro 


Thr 


Pro 


225 










230 






235 










240 


Asn 


Cys 


Ser 


He 


Tyr 


Glu 


Thr 


Gin Asp Ala Asn 


Leu 


Phe 


Leu 




Leu 



245 250 255 
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Thr 


Lys 


Asn 


Gly 








260 


Lys 


Gly 


Ala 


Leu 






275 




Pro 


Phe 


Asp 


Asn 




290 






Thr 


Trp 


Arg 


Tyr 


305 








Phe 


Met 


Pro 


Asn 


Gly 


Asn 


Met 


Leu 








340 


Tyr 


Asn 


Glu 


lie 






355 




Glu 


Pro 


Gly 


Lys 




370 






Thr 


Glu 


Glu 




385 









Ala His Val Leu 

Arg Glu Met His 
280 

Gin Gly Asn Leu 
295 

Gin Glu Thr Asn 
310 

Ser Thr Val Tyr 
325 

lie Gin lie Ser 

Asn Ser Gly Tyr 
360 

Pro Phe His Pro 
375 



Gly Thr lie Thr 
265 

Asp Asn Ala Leu 

Leu Asn Cys Ala 
300 

Ala Val Ala Ser 
315 

Pro Arg Asn Lys 
33 0 

Pro Asn lie Thr 
345 

Ala Phe Thr Phe 

Pro Thr Ala Val 
380 



He 


Lys 


Gly 


Leu 




270 






Ser 


Leu 


Lys 


Leu 


285 








Leu 


Glu 


Ser 


Ser 


Asn 


Ala 


Leu 


Thr 








320 


Thr 


Ala 


His 


Pro 






335 




Phe 


Ser 


Val 


Val 




350 






Lys 


Trp 


Ser 


Ala 


365 








Phe 


Cys 


Tyr 


He 



<210> 23 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 

<221> CDS 

<222> (1) . . . (1194) 

<400> 23 

atg aaa aga acc aga att gaa gac gac ttc aac ccc gtc tac ccc tat 48 

Met Lys Arg Thr Arg lie Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

15 10 " 15 

gac acc ttc tea act ccc age ate ccc tat gta get ccg ccc ttc gtt 96 
Asp Thr Phe Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 
20 25 ~ 30 

tct tct gac ggg tta cag gaa aaa ccc cca gga gtt tta gca etc aag 144 
Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ala Leu Lys 
35 40 " 45 

tac act gac ccc att act acc aat get aag cat gag ctt act tta aaa 192 
Tyr Thr Asp Pro He Thr Thr Asn Ala Lys His Glu Leu Thr Leu Lys 
50 55 60 

ctt gga age aac ata act tta gaa aat ggg tta ctt teg gee aca gtt 24 0 
Leu Gly Ser Asn He Thr Leu Glu Asn Gly Leu Leu Ser Ala Thr Val 
65 70 75 80 

ccc act gtt tct cct ccc ctt aca aac agt aac aac tec ctg ggt tta 288 
Pro Thr Val Ser Pro Pro Leu Thr Asn Ser Asn Asn Ser Leu Gly Leu 
85 90 95 

gee aca tec get ccc ata get gta tea get aac tct etc aca ttg gee 336 
Ala Thr Ser Ala Pro He Ala Val Ser Ala Asn Ser Leu Thr Leu Ala 
100 105 110 



acc gee gca cca ctg aca gta age aac aac cag ctt agt att aac gcg 



384 
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Thr Ala Ala Pro Leu Thr Val Ser Asn Asn Gin Leu Ser lie Asn Ala 
115 120 125 

ggc aga ggt tta gtt ata act aac aat gcc tta aca gtt aat cct acc 432 
Gly Arg Gly Leu Val lie Thr Asn Asn Ala Leu Thr Val Asn Pro Thr 
130 135 140 

gga gcg eta ggt ttc aat aac aca gga get tta caa tta aat get gca 480 
Gly Ala Leu Gly Phe Asn Asn Thr Gly Ala Leu Gin Leu Asn Ala Ala 
145 150 155 160 

gga gga atg aga gtg gac ggt gcc aac tta att ctt cat gta gca tat 52 8 
Gly Gly Met Arg Val Asp Gly Ala Asn Leu lie Leu His Val Ala Tyr 
165 170 175 

ccc ttt gaa gca ate aac cag eta aca ctg cga tta gaa aac ggg tta 576 
Pro Phe Glu Ala lie Asn Gin Leu Thr Leu Arg Leu Glu Asn Gly Leu 
180 185 190 

gaa gta acc age gga gga aag ctt aac gtt aag ttg gga tea ggc etc 624 
Glu Val Thr Ser Gly Gly Lys Leu Asn Val Lys Leu Gly Ser Gly Leu 
195 200 ~ 205 

caa ttt gac agt aac gga cgc att get att agt aat age aac cga act 672 
Gin Phe Asp Ser Asn Gly Arg lie Ala lie Ser Asn Ser Asn Arg Thr 
210 215 220 

cga agt gta cca tec etc act acc att tgg tct ate teg cct acg cct 72 0 
Arg Ser Val Pro Ser Leu Thr Thr lie Trp Ser lie Ser Pro Thr Pro 
225 230 235 ' 240 

aac tgc tec att tat gaa acc caa gat gca aac eta ttt ctt tgt eta 768 
Asn Cys Ser lie Tyr Glu Thr Gin Asp Ala Asn Leu Phe Leu Cys Leu 
245 250 255 

act aaa aac gga get cac gta tta ggt act ata aca ate aaa ggt ctt 816 
Thr Lys Asn Gly Ala His Val Leu Gly Thr He Thr He Lys Gly Leu 
260 265 270 

aaa gga gca ctg egg gaa atg cac gat aac get eta tct tta aaa ctt 864 
Lys Gly Ala Leu Arg Glu Met His Asp Asn Ala Leu Ser Leu Lys Leu 
275 280 285 

ccc ttt gac aat cag gga aat tta ctt aac tgt gcc ttg gaa tea tec 912 
Pro Phe Asp Asn Gin Gly Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser 
290 295 300 

acc tgg cgt tac cag gaa acc aac gca gtg gcc tct aat gcc tta aca 960 
Thr Trp Arg Tyr Gin Glu Thr Asn Ala Val Ala Ser Asn Ala Leu Thr 
305 310 315 320 

ttt atg ccc aac agt aca gtg tat cca cga aac aaa acc get cac ccg 1008 
Phe Met Pro Asn Ser Thr Val Tyr Pro Arg Asn Lys Thr Ala His Pro 
325 330 ^ 335 

ggc aac atg etc ate caa ate teg cct aac ate acc ttc agt gtc gtc 1056 
Gly Asn Met Leu He Gin He Ser Pro Asn He Thr Phe Ser Val Val 
340 345 350 

tac aac gag ata aac tgt gat tgt cgt ggt gat tgt ttt tgt act agt 1104 
Tyr Asn Glu He Asn Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Ser 
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355 360 365 

ggg tat get ttt act ttt aaa tgg tea gec gaa ccg gga aaa cct ttt 1152 
Gly Tyr Ala Phe Thr Phe Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe 
370 375 380 

cac cca cct acc get gta ttt tgc tac ata act gaa gaa taa 1194 
His Pro Pro Thr Ala Val Phe Cys Tyr lie Thr Glu Glu * 
385 390 395 



<210> 24 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 41s FRGD 



<400> 24 



Met 


Lys 


Arg 


Thr 


Arg 


He 


Glu 


Asp 


1 








5 








Asp 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser 


He 








20 










Ser 


Ser 


Asp 


Gly 


Leu 


Gin 


Glu 


Lys 






35 










40 


Tyr 


Thr 


Asp 


Pro 


He 


Thr 


Thr 


Asn 




50 










55 




Leu 


Gly 


Ser 


Asn 


He 


Thr 


Leu 


Glu 


65 










70 






Pro 


Thr 


Val 


Ser 


Pro 


Pro 


Leu 


Thr 










85 








Ala 


Thr 


Ser 


Ala 


Pro 


He 


Ala 


Val 








100 










Thr 


Ala 


Ala 


Pro 


Leu 


Thr 


Val 


Ser 






115 










120 


Gly 


Arg 


Gly 


Leu 


Val 


He 


Thr 


Asn 




130 










135 




Gly 


Ala 


Leu 


Gly 


Phe 


Asn 


Asn 


Thr 


145 










150 






Gly 


Gly 


Met 


Arg 


Val 


Asp 


Gly 


Ala 










165 








Pro 


Phe 


Glu 


Ala 


He 


Asn 


Gin 


Leu 








180 










Glu 


Val 


Thr 


Ser 


Gly 


Gly 


Lys 


Leu 






195 










200 


Gin 


Phe 


Asp 


Ser 


Asn 


Gly 


Arg 


He 




210 










215 




Arg 


Ser 


Val 


Pro 


Ser 


Leu 


Thr 


Thr 


225 










230 






Asn 


Cys 


Ser 


He 


Tyr 


Glu 


Thr 


Gin 










245 








Thr 


Lys 


Asn 


Gly 


Ala 


His 


Val 


Leu 








260 










Lys 


Gly 


Ala 


Leu 


Arg 


Glu 


Met 


His 






275 










280 


Pro 


Phe 


Asp 


Asn 


Gin 


Gly 


Asn 


Leu 




290 










295 




Thr 


Trp 


Arg 


Tyr 


Gin 


Glu 


Thr 


Asn 



305 310 



Asp 


Phe 
10 


Asn 


Pro 


Val 


Tyr 


Pro 
15 


Tyr 


Pro 


Tyr Val 


Ala 


Pro 


Pro 


Phe 


Val 


25 










30 






Pro 


Pro Gly 


Val 


Leu 


Ala 


Leu 


Lys 










45 








Ala 


Lys 


His 


Glu 


Leu 


Thr 


Leu 


Lys 








60 








Asn Gly Leu 


Leu 


Ser 


Ala 


Thr 


Val 






75 










80 


Asn 


Ser 
90 


Asn 


Asn 


Ser 


Leu 


Gly 
95 


Leu 


Ser 


Ala 


Asn 


Ser 


Leu 


Thr 


Leu 


Ala 


105 










110 






Asn 


Asn 


Gin 


Leu 


Ser 
125 


He 


Asn 


Ala 


Asn 


Ala 


Leu 


Thr 
140 


Val 


Asn 


Pro 


Thr 


Gly Ala Leu 


Gin 


Leu 


Asn 


Ala 


Ala 






155 










160 


Asn 


Leu 


He 


Leu 


His 


Val 


Ala 


Tyr 




170 










175 


Thr 


Leu Arg 


Leu 


Glu 


Asn 


Gly 


Leu 


185 










190 






Asn 


Val 


Lys 


Leu 


Gly 
205 


Ser 


Gly 


Leu 


Ala 


lie 


Ser 


Asn 


Ser 


Asn 


Arg 


Thr 








220 








He 


Trp 


Ser 
235 


He 


Ser 


Pro 


Thr 


Pro 
240 


Asp 


Ala 
250 


Asn 


Leu 


Phe 


Leu 


Cys 
255 


Leu 


Gly 


Thr 


He 


Thr 


He 


Lys 


Gly 


Leu 


265 










270 






Asp 


Asn 


Ala 


Leu 


Ser 
285 


Leu 


Lys 


Leu 


Leu 


Asn 


Cys 


Ala 
300 


Leu 


Glu 


Ser 


Ser 


Ala 


Val 


Ala 
315 


Ser 


Asn 


Ala 


Leu 


Thr 
320 
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Phe Met Pro Asn Ser Thr Val Tyr Pro Arg Asn Lys Thr Ala His Pro 

325 330 335 

Gly Asn Met Leu lie Gin lie Ser Pro Asn lie Thr Phe Ser Val Val 

340 345 350 

Tyr Asn Glu lie Asn Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Ser 

355 360 ' 365 

Gly Tyr Ala Phe Thr Phe Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe 

370 375 380 

His Pro Pro Thr Ala Val Phe Cys Tyr He Thr Glu Glu 
385 390 395 

<210> 25 
<211> 1737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad 5 pent on 

<221> CDS 

<222> (1) . . . (1737) 

<400> 25 

atg egg cgc gcg gcg atg tat gag gaa ggt cct cct ccc tec tac gag 4 8 

Met Arg Arg Ala Ala Met Tyr Glu Glu Gly Pro Pro Pro Ser Tyr Glu 
15 10 15 

agt gtg gtg age gcg gcg cca gtg gcg gcg gcg ctg ggt tct ccc ttc 96 
Ser Val Val Ser Ala Ala Pro Val Ala Ala Ala Leu Gly Ser Pro Phe 
20 25 " 30 

gat get ccc ctg gac ccg ccg ttt gtg cct ccg egg tac ctg egg cct 144 
Asp Ala Pro Leu Asp Pro Pro Phe Val Pro Pro Arg Tyr Leu Arg Pro 
35 40 45 

ace ggg ggg aga aac age ate cgt tac tct gag ttg gca ccc eta ttc 192 
Thr Gly Gly Arg Asn Ser He Arg Tyr Ser Glu Leu Ala Pro Leu Phe 
50 55 60 

gac ace ace cgt gtg tac ctg gtg gac aac aag tea acg gat gtg gca 24 0 
Asp Thr Thr Arg Val Tyr Leu Val Asp Asn Lys Ser Thr Asp Val Ala 
65 70 75 80 

tec ctg aac tac cag aac gac cac age aac ttt ctg acc acg gtc att 2 88 
Ser Leu Asn Tyr Gin Asn Asp His Ser Asn Phe Leu Thr Thr Val He 
85 90 95 

caa aac aat gac tac age ccg ggg gag gca age aca cag acc ate aat 33 6 
Gin Asn Asn Asp Tyr Ser Pro Gly Glu Ala Ser Thr Gin Thr He Asn 
100 105 110 

ctt gac gac egg teg cac tgg ggc ggc gac ctg aaa acc ate ctg cat 3 84 
Leu Asp Asp Arg Ser His Trp Gly Gly Asp Leu Lys Thr He Leu His 
115 120 125 

acc aac atg cca aat gtg aac gag ttc atg ttt acc aat aag ttt aag 432 
Thr Asn Met Pro Asn Val Asn Glu Phe Met Phe Thr Asn Lys Phe Lys 
130 135 140 

gcg egg gtg atg gtg teg cgc ttg cct act aag gac aat cag gtg gag 4 80 
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Ala Arg Val Met Val Ser Arg Leu Pro Thr Lys Asp Asn Gin Val Glu 
145 150 155 160 

ctg aaa tac gag tgg gtg gag ttc acg ctg ccc gag ggc aac tac tec 52 8 

Leu Lys Tyr Glu Trp Val Glu Phe Thr Leu Pro Glu Gly Asn Tyr Ser 
165 170 175 

gag acc atg acc ata gac ctt atg aac aac gcg ate gtg gag cac tac 576 

Glu Thr Met Thr lie Asp Leu Met Asn Asn Ala He Val Glu His Tyr 
180 185 190 

ttg aaa gtg ggc aga cag aac ggg gtt ctg gaa age gac ate ggg gta 624 

Leu Lys Val Gly Arg Gin Asn Gly Val Leu Glu Ser Asp He Gly Val 

195 200 205 

aag ttt gac acc cgc aac ttc aga ctg ggg ttt gac ccc gtc act ggt 672 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Phe Asp Pro Val Thr Gly 
210 215 220 

ctt gtc atg cct ggg gta tat aca aac gaa gec ttc cat cca gac ate 72 0 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

att ttg ctg cca gga tgc ggg gtg gac ttc acc cac age cgc ctg age 768 

He Leu Leu Pro Gly Cys Gly Val Asp Phe Thr His Ser Arg Leu Ser 
245 1 250 255 

aac ttg ttg ggc ate cgc aag egg caa ccc ttc cag gag ggc ttt agg 816 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Arg 
260 * 265 270 

ate acc tac gat gat ctg gag ggt ggt aac att ccc gca ctg ttg gat 864 

He Thr Tyr Asp Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 

275 280 285 

gtg gac gee tac cag gcg age ttg aaa gat gac acc gaa cag ggc ggg 912 

Val Asp Ala Tyr Gin Ala Ser Leu Lys Asp Asp Thr Glu Gin Gly Gly 
290 295 300 

ggt ggc gca ggc ggc age aac age agt ggc age ggc gcg gaa gag aac 960 

Gly Gly Ala Gly Gly Ser Asn Ser Ser Gly Ser Gly Ala Glu Glu Asn 
305 310 315 320 

tec aac gcg gca gee gcg gca atg cag ccg gtg gag gac atg aac gat 10 0 8 

Ser Asn Ala Ala Ala Ala Ala Met Gin Pro Val Glu Asp Met Asn Asp 
325 330 335 

age cgc ggc tac ccc tac gac gtg ccc gac tac gcg ggc acc age gee 1056 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser Ala 
340 345 350 

aca egg get gag gag aag cgc get gag gee gaa gca gcg gee gaa get 1104 

Thr Arg Ala Glu Glu Lys Arg Ala Glu Ala Glu Ala Ala Ala Glu Ala 

355 360 365 

gee gee ccc get gcg caa ccc gag gtc gag aag cct cag aag aaa ccg 1152 

Ala Ala Pro Ala Ala Gin Pro Glu Val Glu Lys Pro Gin Lys Lys Pro 
370 375 380 

gtg ate aaa ccc ctg aca gag gac age aag aaa cgc agt tac aac eta 12 0 0 

Val He Lys Pro Leu Thr Glu Asp Ser Lys Lys Arg Ser Tyr Asn Leu 



WO 2004/099422 



PCT/US2004/009219 



-46- 



385 390 395 400 

ata age aat gac age acc ttc acc cag tac cgc age tgg tac ctt gca 124 8 
He Ser Asn Asp Ser Thr Phe Thr Gin Tyr Arg Ser Trp Tyr Leu Ala 
405 410 415 

tac aac tac ggc gac cct cag acc gga ate cgc tea tgg acc ctg ctt 12 96 
Tyr Asn Tyr Gly Asp Pro Gin Thr Gly He Arg Ser Trp Thr Leu Leu 
42 0 42 5 43 0 

tgc act cct gac gta acc tgc ggc teg gag cag gtc tac tgg teg ttg 1344 
Cys Thr Pro Asp Val Thr Cys Gly Ser Glu Gin Val Tyr Trp Ser Leu 
435 440 445 

cca gac atg atg caa gac ccc gtg acc ttc cgc tec acg cgc cag ate 1392 
Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin lie 
450 455 460 

age aac ttt ccg gtg gtg ggc gee gag ctg ttg ccc gtg cac tec aag 144 0 
Ser Asn Phe Pro Val Val Gly Ala Glu Leu Leu Pro Val His Ser Lys 
465 470 475 480 

age ttc tac aac gac cag gee gtc tac tec caa etc ate cgc cag ttt 1488 
Ser Phe Tyr Asn Asp Gin Ala Val Tyr Ser Gin Leu lie Arg Gin Phe 
485 490 495 

acc tct ctg acc cac gtg ttc aat cgc ttt ccc gag aac cag att ttg 1536 
Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu 
500 505 510 

gcg cgc ccg cca gee ccc acc ate acc acc gtc agt gaa aac gtt cct 1584 
Ala Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro 
515 520 525 

get etc aca gat cac ggg acg eta ccg ctg cgc aac age ate gga gga 163 2 
Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Asn Ser He Gly Gly 
530 535 540 

gtc cag cga gtg acc att act gac gec aga cgc cgc acc tgc ccc tac 1680 
Val Gin Arg Val Thr lie Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
545 550 555 560 

gtt tac aag gee ctg ggc ata gtc teg ccg cgc gtc eta teg age cgc 172 8 
Val Tyr Lys Ala Leu Gly He Val Ser Pro Arg Val Leu Ser Ser Arg 
565 570 575 

act ttt tga 173 7 

Thr Phe * 



<210> 26 
<211> 577 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5 pent on 



<400> 26 

Arg Arg Ala Ala Met Tyr Glu Glu Gly Pro Pro Pro Ser Tyr Glu Ser 



WO 2004/099422 



PCT/US2004/009219 



-47- 



Val 


Val 


Ser 


Ala 


Ala 


Pro 


Val 


Ala 








20 










Ala 


Pro 


Leu Asp 


Pro 


Pro 


Phe 


Val 






35 










40 


Gly Gly 


Arg Asn 


Ser 


He 


Arg 


Tyr 




50 










55 




Thr 


Thr 


Arg 


Val 


Tyr 


Leu 


Val 


Asp 


65 










70 






Leu 


Asn 


Tyr 


Gin 


Asn 


Asp 


His 


Ser 










85 








Asn 


Asn 


Asp 


Tyr 


Ser 


Pro 


Gly 


Glu 








100 










Asp 


Asp 


Arg 


Ser 


His 


Trp 


Gly 


Gly 






115 










120 


Asn 


Met 


Pro 


Asn 


Val 


Asn 


Glu 


Phe 




130 










135 




Arg 


Val 


Met 


Val 


Ser 


Arg 


Leu 


Pro 


145 










150 






Lys 


Tyr 


Glu 


Trp 


Val 


Glu 


Phe 


Thr 








165 








Thr 


Met 


Thr 


He 


Asp 


Leu 


Met 


Asn 








180 










Lys 


Val 


Gly Arg Gin 


Asn Gly Val 






195 










200 


Phe 


Asp 


Thr Arg Asn 


Phe 


Arg 


Leu 




210 










215 




Val 


Met 


Pro Gly Val 


Tyr 


Thr 


Asn 


225 










230 






Leu 


Leu 


Pro 


Gly Cys 


Gly Val 


Asp 










245 








Leu 


Leu 


Gly He Arg 


Lys 


Arg 


Gin 








260 










Thr 


Tyr 


Asp 


Asp 


Leu 


Glu Gly Gly 






275 










280 


Asp 


Ala 


Tyr 


Gin 


Ala 


Ser 


Leu 


Lys 




290 










295 




Gly Ala 


Gly Gly 


Ser 


Asn 


Ser 


Ser 


305 










310 






Asn 


Ala 


Ala 


Ala 


Ala 


Ala 


Met 


Gin 










325 








Arg Gly 


Tyr 


Pro 


Tyr 


Asp 


Val 


Pro 








340 










Arg Ala 


Glu 


Glu 


Lys 


Arg 


Ala 


Glu 






355 










360 


Ala 


Pro 


Ala 


Ala 


Gin 


Pro 


Glu 


Val 




370 










375 




He 


Lys 


Pro 


Leu 


Thr 


Glu Asp 


Ser 


385 








390 






Ser 


Asn 


Asp 


Ser 


Thr 


Phe 


Thr 


Gin 










405 








Asn 


Tyr 


Gly 


Asp 


Pro 


Gin 


Thr 


Gly 








420 










Thr 


Pro 


Asp 


Val 


Thr 


Cys 


Gly 


Ser 






435 










440 


Asp 


Met 


Met 


Gin 


Asp 


Pro 


Val 


Thr 


450 










455 




Asn 


Phe 


Pro 


Val 


Val 


Gly Ala 


Glu 


465 










470 






Phe 


Tyr 


Asn Asp 


Gin 


Ala 


Val 


Tyr 










485 











10 










15 




Ala 


Ala 


Leu Gly 


Ser 


Pro 


Phe 


Asp 


25 










30 






Pro 


Pro 


Arg 


Tyr 


Leu Arg 


Pro 


Thr 










45 








Ser 


Glu 


Leu 


Ala 


Pro 


Leu 


Phe 


Asp 








60 










Asn 


Lys 


Ser 


Thr 


Asp 


Val 


Ala 


Ser 






75 










80 


Asn 


Phe 


Leu 


Thr 


Thr 


Val 


He 


Gin 




90 










95 




Ala 


Ser 


Thr 


Gin 


Thr 


He 


Asn 


Leu 


105 










110 






Asp 


Leu 


Lys 


Thr 


He 


Leu 


His 


Thr 










125 








Met 


Phe 


Thr 


Asn 


Lys 


Phe 


Lys 


Ala 








140 










Thr 


Lys 


Asp 


Asn 


Gin 


Val 


Glu 


Leu 




155 










160 


Leu 


Pro 


Glu 


Gly 


Asn 


Tyr 


Ser 


Glu 




170 










175 




Asn 


Ala 


He 


Val 


Glu 


His 


Tyr 


Leu 


185 










190 






Leu 


Glu 


Ser Asp 


He 


Gly Val 


Lys 










205 








Gly 


Phe 


Asp 


Pro 


Val 


Thr 


Gly 


Leu 








220 










Glu 


Ala 


Phe 


His 


Pro 


Asp 


He 


He 






235 










240 


Phe 


Thr 


His 


Ser 


Arg 


Leu 


Ser 


Asn 




250 










255 




Pro 


Phe 


Gin 


Glu 


Gly 


Phe 


Arg 


He 


265 










270 






Asn 


He 


Pro 


Ala 


Leu 


Leu 


Asp 


Val 










285 








Asp 


Asp 


Thr 


Glu 


Gin Gly Gly 


Gly 








300 










Gly Ser 


Gly Ala 


Glu 


Glu 


Asn 


Ser 






315 










320 


Pro 


Val 


Glu Asp 


Met 


Asn 


Asp 


Ser 




330 










335 




Asp 


Tyr 


Ala Gly 


Thr 


Ser 


Ala 


Thr 


345 










350 






Ala 


Glu 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 










365 








Glu 


Lys 


Pro 


Gin 


Lys 


Lys 


Pro 


Val 








380 










Lys 


Lys 


Arg 


Ser 


Tyr 


Asn 


Leu 


He 






395 










400 


Tyr 


Arg 


Ser 


Trp 


Tyr 


Leu 


Ala 


Tyr 




410 










415 




He 


Arg 


Ser 


Trp 


Thr 


Leu 


Leu 


Cys 


425 










430 






Glu 


Gin 


Val 


Tyr 


Trp 


Ser 


Leu 


Pro 










445 








Phe 


Arg 


Ser 


Thr 


Arg 


Gin 


He 


Ser 








460 










Leu 


Leu 


Pro 


Val 


His 


Ser 


Lys 


Ser 






475 










480 


Ser 


Gin 


Leu 


He 


Arg 


Gin 


Phe 


Thr 




490 










495 
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Ser Leu Thr 


His 


Val 


Phe 


Asn 


Arg 


Phe 


Pro 


Glu 


Asn 


Gin 


He Leu 


Ala 




500 








505 










510 




Arg Pro Pro 


Ala 


Pro 


Thr 


He 


Thr 


Thr 


Val 


Ser 


Glu 


Asn 


Val Pro 


Ala 


515 










520 










525 






Leu Thr Asp His 


Gly 


Thr 


Leu 


Pro 


Leu 


Arg 


Asn 


Ser 


He Gly Gly 


Val 


530 








535 










540 








Gin Arg Val 


Thr 


He 


Thr 


Asp 


Ala 


Arg 


Arg 


Arg 


Thr 


Cys 


Pro Tyr 


Val 


545 






550 










555 








560 


Tyr Lys Ala 


Leu 


Gly 
565 


He 


Val 


Ser 


Pro 


Arg 
570 


Val 


Leu 


Ser 


Ser Arg 
575 


Thr 



Phe 



<210> 27 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 

<221> CDS 

<222> (1) . . . (1773) 

<400> 27 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 4 8 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 96 
Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
20 25 30 

ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct 144 
Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 
35 40 45 

ttg cgc eta tec gaa cct eta gtt acc tec aat ggc atg ctt gcg etc 192 
Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 
50 55 60 

aaa atg ggc aac ggc etc tct ctg gac gag gec ggc aac ctt acc tec 24 0 
Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

caa aat gta acc act gtg age cca cct etc aaa aaa acc aag tea aac 2 88 
Gin Asn Val Thr Thr Val Ser Pro Pro Leu Lys Lys Thr Lys Ser Asn 
85 90 ~ ~ 95 

ata aac ctg gaa ata tct gca ccc etc aca gtt acc tea gaa gec eta 336 
He Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 
100 105 110 

act gtg get gec gec gca cct eta atg gtc gcg ggc aac aca etc acc 3 84 
Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 
115 120 125 

atg caa tea cag gec ccg eta acc gtg cac gac tec aaa ctt age att 432 
Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 
130 135 140 
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gcc acc caa gga ccc etc aca gtg tea gaa gga aag eta gec ctg caa 480 
Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

aca tea ggc ccc etc acc acc acc gat age agt acc ctt act ate act 528 
Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 
165 170 175 

gee tea ccc cct eta act act gee act ggt age ttg ggc att gac ttg 576 
Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 
180 185 ~ 190 

aaa gag ccc att tat aca caa aat gga aaa eta gga eta aag tac ggg 624 
Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 
195 2 00 2 05 

get cct ttg cat gta aca gac gac eta aac act ttg acc gta gca act 672 
Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
210 215 220 

ggt cca ggt gtg act att aat aat act tec ttg caa act aaa gtt act 72 0 
Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

gga gee ttg ggt ttt gat tea caa ggc aat atg caa ctt aat gta gca 76 8 
Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 
245 250 255 

99 a 99 a eta agg att gat tct caa aac aga cgc ctt ata ctt gat gtt 816 
Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 
260 265 270 

agt tat ccg ttt gat get caa aac caa eta aat eta aga eta gga cag 864 
Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 
275 280 285 

ggc cct ctt ttt ata aac tea gee cac aac ttg gat att aac tac aac 912 
Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 
290 295 300 

aaa ggc ctt tac ttg ttt aca get tea aac aat tec aaa aag ctt gag 960 
Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 " 320 

gtt aac eta age act gee aag ggg ttg atg ttt gac get aca gee ata 10 0 8 
Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 
325 330 335 

gec att aat gca gga gat ggg ctt gaa ttt ggt tea cct aat gca cca 1056 
Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 
340 345 ~ 350 

aac aca aat ccc etc aaa aca aaa att ggc cat ggc eta gaa ttt gat 1104 
Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 
355 360 365 

tea aac aag get atg gtt cct aaa eta gga act ggc ctt agt ttt gac 1152 
Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
370 375 380 
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agc aca ggt gcc att aca gta gga aac aaa aat aat gat aag eta act 
Ser Thr Gly Ala lie Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 



1200 



ttg tgg acc gga ata aac cct cca cct aac tgt caa att gtg gaa aac 
Leu Trp Thr Gly lie Asn Pro Pro Pro Asn Cys Gin lie Val Glu Asn 
405 410 415 



1248 



act aat aca aat gat ggc aaa ctt act tta gta tta gta aaa aat gga 
Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val Leu Val Lys Asn Gly 
42 0 42 5 43 0 



1296 



ggg ctt gtt aat ggc tac gtg tct eta gtt ggt gta tea gac act gtg 
Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly Val Ser Asp Thr Val 
435 440 445 



1344 



aac caa atg ttc aca caa aag aca gca aac ate caa tta aga tta tat 
Asn Gin Met Phe Thr Gin Lys Thr Ala Asn lie Gin Leu Arg Leu Tyr 
450 455 460 



1392 



ttt gac tct tct gga aat eta tta act gag gaa tea gac tta aaa att 
Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu Ser Asp Leu Lys lie 
465 470 475 480 



1440 



cca ctt aaa aat aaa tct tct aca gcg acc agt gaa act gta gcc age 
Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser Glu Thr Val Ala Ser 
485 490 495 



1488 



age aaa gcc ttt atg cca agt act aca get tat ccc ttc aac acc act 
Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe Asn Thr Thr 
500 505 510 



1536 



act agg gat agt gaa aac tac att cat gga ata tgt tac tac atg act 
Thr Arg Asp Ser Glu Asn Tyr lie His Gly lie Cys Tyr Tyr Met Thr 
515 520 525 



1584 



agt tat gat aga agt eta ttt ccc ttg aac att tct ata atg eta aac 
Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn lie Ser lie Met Leu Asn 
530 535 540 



1632 



age cgt atg att tct tec aat gtt gcc tat gcc ata caa ttt gaa tgg 
Ser Arg Met lie Ser Ser Asn Val Ala Tyr Ala He Gin Phe Glu Trp 
545 550 555 560 



1680 



aat eta aat gca agt gaa tct cca gaa age aac ata get acg ctg acc 
Asn Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn He Ala Thr Leu Thr 
565 570 575 



1728 



aca tec ccc ttt ttc ttt tct tac att aca gaa gac gac gaa taa 
Thr Ser Pro Phe Phe Phe Ser Tyr He Thr Glu Asp Asp Glu * 
580 " 585 ~ 590 



1773 



<210> 28 
<211> 590 
<212> PRT 
<213> Artificial 



Sequence 



<220> 

<223> 5TS35H 
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<400> 28 












Met 


Lys 


Arg Ala Arg 


Pro 


Ser 


Glu 


1 








5 








Tyr 


Asp 


Thr Glu Thr Gly Pro 


Pro 








20 










Phe 


Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 






35 










40 


Leu 


Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 




50 










55 




Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


65 










70 






Gin 


Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 










85 








He 


Asn 


Leu 


Glu 


He 


Ser 


Ala 


Pro 








100 










Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 






115 










120 


Met 


Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 




130 










135 




Ala 


Thr 


Gin Gly 


Pro 


Leu 


Thr 


Val 


145 










150 






Thr 


Ser 


Gly Pro 


Leu 


Thr 


Thr 


Thr 










165 








Ala 


Ser 


Pro 


Pro 


Leu 


Thr 


Thr 


Ala 








180 










Lys 


Glu 


Pro 


He 


Tyr 


Thr 


Gin 


Asn 






195 










200 


Ala 


Pro 


Leu 


His 


Val 


Thr 


Asp 


Asp 




210 










215 


Gly 


Pro 


Gly Val 


Thr 


He 


Asn 


Asn 


225 










230 






Gly Ala 


Leu 


Gly 


Phe 


As P 


Ser 


Gin 










245 








Gly Gly 


Leu 


Arg 


He 


Asp 


Ser 


Gin 








260 










Ser 


Tyr 


Pro 


Phe 


Asp 


Ala 


Gin 


Asn 






275 










280 


Gly 


Pro 


Leu 


Phe 


He 


Asn 


Ser 


Ala 




290 










295 




Lys 


Gly 


Leu 


Tyr 


Leu 


Phe 


Thr 


Ala 


305 










310 






Val 


Asn 


Leu 


Ser 


Thr 


Ala 


Lys 


Gly 










325 




Ala 


He 


Asn Ala Gly Asp 


Gly 


Leu 








340 










Asn 


Thr 


Asn 


Pro 


Leu 


Lys 


Thr 


Lys 






355 










360 


Ser 


Asn 


Lys 


Ala 


Met 


Val 


Pro 


Lys 




370 










375 


Ser 


Thr 


Gly Ala 


He 


Thr 


Val 


Gly 


385 










390 






Leu 


Trp 


Thr 


Gly 


He 


Asn 


Pro 


Pro 










405 








Thr 


Asn 


Thr 


Asn Asp 


Gly 


Lys 


Leu 








420 










Gly 


Leu 


Val 


Asn 


Gly 


Tyr 


Val 


Ser 






435 










440 


Asn 


Gin 


Met 


Phe 


Thr 


Gin 


Lys 


Thr 




450 










455 




Phe 


Asp 


Ser 


Ser 


Gly Asn 


Leu 


Leu 


465 










470 







Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




Thr 


Val 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


25 










30 






Glu 


Ser 


Pro 


Pro 


Gly Val 


Leu 


Ser 










45 








Thr 


Ser 


Asn 


Gly 


Met 


Leu 


Ala 


Leu 








60 










Asp 


Glu 


Ala 


Gly Asn 


Leu 


Thr 


Ser 






75 










80 


Pro 


Leu 


Lys 


Lys 


Thr 


Lys 


Ser 


Asn 




90 










95 




Leu 


Thr 


Val 


Thr 


Ser 


Glu 


Ala 


Leu 


105 










110 






Met 


Val 


Ala 


Gly Asn 


Thr 


Leu 


Thr 










125 








Val 


His 


Asp 


Ser 


Lys 


Leu 


Ser 


He 








140 










Ser 


Glu 


Gly 


Lys 


Leu 


Ala 


Leu 


Gin 






155 










160 


Asp 


Ser 


Ser 


Thr 


Leu 


Thr 


He 


Thr 




170 










175 




Thr Gly 


Ser 


Leu Gly 


He 


Asp 


Leu 


185 










190 






Gly Lys 


Leu 


Gly Leu 


Lys 


Tyr 


Gly 










205 








Leu 


Asn 


Thr 


Leu 


Thr 


Val 


Ala 


Thr 








220 










Thr 


Ser 


Leu 


Gin 


Thr 


Lys 


Val 


Thr 






235 








240 


Gly Asn 


Met 


Gin 


Leu 


Asn 


Val 


Ala 




250 










255 




Asn Arg 


Arg 


Leu 


He 


Leu 


Asp 


Val 


265 










270 






Gin 


Leu 


Asn 


Leu 


Arg 


Leu 


Gly 


Gin 










285 








His 


Asn 


Leu 


Asp 


He 


Asn 


Tyr 


Asn 








300 










Ser 


Asn 


Asn 


Ser 


Lys 


Lys 


Leu 


Glu 






315 










320 


Leu 


Met 


Phe 


Asp 


Ala 


Thr 


Ala 


He 




330 










335 




Glu 


Phe 


Gly 


Ser 


Pro 


Asn 


Ala 


Pro 


345 










350 






He 


Gly 


His 


Gly 


Leu 


Glu 


Phe 


Asp 










365 








Leu 


Gly 


Thr 


Gly 
380 


Leu 


Ser 


Phe 


Asp 


Asn 


Lys 


Asn 


Asn 


Asp 


Lys 


Leu 


Thr 






395 










400 


Pro 


Asn 


Cys 


Gin 


He 


Val 


Glu 


Asn 




410 










415 




Thr 


Leu 


Val 


Leu 


Val 


Lys 


Asn 


Gly 


425 










430 




Leu 


Val 


Gly 


Val 


Ser 


Asp 


Thr 


Val 










445 








Ala 


Asn 


He 


Gin 


Leu 


Arg 


Leu 


Tyr 








460 










Thr 


Glu 


Glu 


Ser Asp 


Leu 


Lys 


He 






475 










480 
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Pro Leu 


Lys 


Asn 


Lys 


Ser 


Ser 


Thr 


Ala 


Thr 


Ser 


Glu 


Thr 


Val 


Ala 


Ser 






485 










490 










495 




Ser Lys 


Ala 


Phe 


Met 


Pro 


Ser 


Thr 


Thr 


Ala 


Tyr 


Pro 


Phe 


Asn 


Thr 


Thr 




500 










505 










510 






Thr Arg 


Asp 


Ser 


Glu 


Asn 


Tyr 


He 


His 


Gly 


He 


Cys 


Tyr 


Tyr 


Met 


Thr 


515 










520 










525 








Ser Tyr 


Asp 


Arg 


Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 


Ser 


He 


Met 


Leu 


Asn 


530 






535 










540 










Ser Arg 


Met 


He 


Ser 


Ser 


Asn Val Ala Tyr Ala 


He 


Gin 


Phe 


Glu 


Trp 


545 








550 










555 










560 


Asn Leu 


Asn 


Ala 


Ser 


Glu 


Ser 


Pro 


Glu 


Ser 


Asn 


He 


Ala 


Thr 


Leu 


Thr 








565 










570 










575 




Thr Ser 


Pro 


Phe 


Phe 


Phe 


Ser 


Tyr 


He 


Thr 


Glu Asp 


Asp 


Glu 







580 585 590 



<210> 29 
<211> 945 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 35TS5H 

<221> CDS 

<222> (1) . . . (945) 

<400> 29 

atg acc aag aga gtc egg etc agt gac tec ttc aac cct gtc tac ccc 4 8 
Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 
1 ~ 5 10 15 



tat gaa gat gaa age acc tec caa cac ccc ttt ata aac cca ggg ttt 
Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe He Asn Pro Gly Phe 
20 25 30 



aaa tgt tta acc cca eta aca acc aca ggc gga tct eta cag eta aaa 
Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 
50 55 60 



96 



att tec cca aat ggc ttc aca caa age cca gac gga gtt ctt act tta 144 
He Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 40 45 



192 



gtg gga ggg gga ctt aca gtg gat gac act gat ggt acc tta caa gaa 240 

Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 

65 " 70 75 80 

aac ata cgt get aca gca ccc att act aaa aat aat cac tct gta gaa 288 

Asn He Arg Ala Thr Ala Pro He Thr Lys Asn Asn His Ser Val Glu 

85 90 95 

eta tec att gga aat gga tta gaa act caa aac aat aaa eta tgt gec 33 6 

Leu Ser He Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 

100 105 110 

aaa ttg gga aat ggg tta aaa ttt aac aac ggt gac att tgt ata aag 3 84 

Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp He Cys He Lys 
115 " 12 0 125 

gat agt att aac acc tta tgg act aca cca get cca tct cct aac tgt 432 
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Asp Ser lie Asn Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys 
130 135 140 

aga eta aat gca gag aaa gat get aaa etc act ttg gtc tta aca aaa 4 80 
Arg Leu Asn Ala Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys 
145 150 155 160 

tgt ggc agt caa ata ctt get aca gtt tea gtt ttg get gtt aaa ggc 52 8 
Cys Gly Ser Gin lie Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly 
165 170 175 

agt ttg get cca ata tct gga aca gtt caa agt get cat ctt att ata 576 
Ser Leu Ala Pro lie Ser Gly Thr Val Gin Ser Ala His Leu He He 
180 185 190 

aga ttt gac gaa aat gga gtg eta eta aac aat tec ttc ctg gac cca 624 
Arg Phe Asp Glu Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro 
195 200 205 

gaa tat tgg aac ttt aga aat gga gat ctt act gaa ggc aca gee tat 672 
Glu Tyr Trp Asn Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr 
210 215 220 

aca aac get gtt gga ttt atg cct aac eta tea get tat cca aaa tct 72 0 
Thr Asn Ala Val Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser 
225 230 235 240 

cac ggt aaa act gee aaa agt aac att gtc agt caa gtt tac tta aac 768 
His Gly Lys Thr Ala Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn 
245 250 255 

gga gac aaa act aaa cct gta aca eta acc att aca eta aac ggt aca 816 
Gly Asp Lys Thr Lys Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr 
260 265 270 

cag gaa aca gga gac aca act cca agt gca tac tct atg tea ttt tea 864 
Gin Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser 
275 280 285 

tgg gac tgg tct ggc cac aac tac att aat gaa ata ttt gee aca tec 912 
Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu lie Phe Ala Thr Ser 
290 295 300 

tct tac act ttt tea tac att gee caa gaa taa 945 
Ser Tyr Thr Phe Ser Tyr He Ala Gin Glu * 
305 310 



<210> 30 
<211> 314 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 35TS5H 
<400> 30 

Met Thr Lys Arg Val Arg Leu Ser 

1 5 
Tyr Glu Asp Glu Ser Thr Ser Gin 
20 



Asp Ser Phe Asn Pro Val Tyr Pro 

10 15 

His Pro Phe He Asn Pro Gly Phe 
25 30 
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lie 


Ser 


Pro 


Asn Gly 


Phe 


Thr 


Gin 


Ser 


Pro 


Asp 


Gly Val 


Leu 


Thr 


Leu 






35 










40 










45 








Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr 


Gly Gly 


Ser 


Leu 


Gin 


Leu 


Lys 




50 










55 










60 








Val 


Gly 


Gly 


Gly Leu Thr 


Val 


Asp 


Asp 


Thr Asp Gly Thr Leu 


Gin 


Glu 


65 










70 










75 










80 


Asn 


lie 


Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 










85 










90 










95 




Leu 


Ser 


lie 


Gly Asn Gly 


Leu 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 








100 










105 








110 




Lys 


Leu 


Gly 


Asn Gly Leu 


Lys 


Phe Asn Asn Gly Asp 


He 


Cys 


He 


Lys 






115 










120 










125 








Asp 


Ser 


lie 


Asn 


Thr 


Leu 


Trp 


Thr 


Thr 


Pro 


Ala 


Pro 


Ser 


Pro 


Asn 


Cys 




130 










135 










140 








Arg 


Leu 


Asn 


Ala 


Glu 


Lys 


Asp 


Ala 


Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Thr 


Lys 


145 










150 










155 










160 


Cys 


Gly 


Ser 


Gin 


lie 


Leu 


Ala 


Thr 


Val 


Ser 


Val 


Leu 


Ala 


Val 


Lys 


Gly 










165 










170 










175 


Ser 


Leu 


Ala 


Pro 


lie 


Ser 


Gly 


Thr 


Val 


Gin 


Ser 


Ala 


His 


Leu 


He 


He 








180 










185 










190 






Arg 


Phe 


Asp 


Glu 


Asn 


Gly 


Val 


Leu 


Leu 


Asn 


Asn 


Ser 


Phe 


Leu 


Asp 


Pro 






195 










200 










205 






Glu 


Tyr 


Trp 


Asn 


Phe 


Arg 


Asn Gly Asp 


Leu 


Thr 


Glu 


Gly 


Thr 


Ala 


Tyr 




210 










215 










220 






Thr 


Asn 


Ala 


Val 


Gly 


Phe 


Met 


Pro 


Asn 


Leu 


Ser 


Ala 


Tyr 


Pro 


Lys 


Ser 


225 










230 










235 






240 


His 


Gly 


Lys 


Thr 


Ala 


Lys 


Ser 


Asn 


He 


Val 


Ser 


Gin 


Val 


Tyr 


Leu 


Asn 










245 










250 








255 




Gly Asp 


Lys 


Thr 


Lys 


Pro 


Val 


Thr 


Leu 


Thr 


He 


Thr 


Leu 


Asn 


Gly 


Thr 








260 










265 










270 




Gin 


Glu 


Thr 


Gly Asp 


Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


Ser 


Met 


Ser 


Phe 


Ser 






275 










280 










285 








Trp 


Asp 


Trp 


Ser 


Gly 


His 


Asn 


Tyr 


He 


Asn 


Glu 


He 


Phe 


Ala 


Thr 


Ser 




290 










295 










300 










Ser 


Tyr 


Thr 


Phe 


Ser 


Tyr 


lie 


Ala 


Gin 


Glu 














305 










310 























<210> 31 
<211> 1098 
<212> DNA 

<213> Adenovirus type 3 7 

<220> 

<221> CDS 

<222> (1) . . . (1098) 

<400> 31 

atg tea aag agg etc egg gtg gaa gat gac ttc aac ccc gtc tac ccc 4 8 

Met Ser Lys Arg Leu Arg Val Glu Asp Asp Phe Asn Pro Val Tyr Pro 
15 10 15 

tat ggc tac gcg egg aat cag aat ate ccc ttc etc act ccc ccc ttt 96 
Tyr Gly Tyr Ala Arg Asn Gin Asn He Pro Phe Leu Thr Pro Pro Phe 
20 25 30 

gtc tec tec gat gga ttc aaa aac ttc ccc cct ggg gta ctg tea etc 144 
Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro Gly Val Leu Ser Leu 
35 40 45 



aaa ctg get gat cca ate acc att acc aat ggg gat gta tec etc aag 192 
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Lys Leu Ala Asp Pro lie Thr He Thr Asn Gly Asp Val Ser Leu Lys 
50 55 60 

gtg gga ggt ggt etc act ttg caa gat gga age eta act gta aac cct 24 0 
Val Gly Gly Gly Leu Thr Leu Gin Asp Gly Ser Leu Thr Val Asn Pro 
65 70 " 75 80 

aag get cca ctg caa gtt aat act gat aaa aaa ctt gag ctt gca tat 2 88 
Lys Ala Pro Leu Gin Val Asn Thr Asp Lys Lys Leu Glu Leu Ala Tyr 
85 90 ~ 95 

gat aat cca ttt gaa agt agt get aat aaa ctt agt tta aaa gta gga 33 6 
Asp Asn Pro Phe Glu Ser Ser Ala Asn Lys Leu Ser Leu Lys Val Gly 
100 105 110 

cat 99& tta aaa gta tta gat gaa aaa agt get gcg ggg tta aaa gat 3 84 
His Gly Leu Lys Val Leu Asp Glu Lys Ser Ala Ala Gly Leu Lys Asp 
115 120 125 

tta att ggc aaa ctt gtg gtt tta aca gga aaa gga ata ggc act gaa 432 
Leu He Gly Lys Leu Val Val Leu Thr Gly Lys Gly He Gly Thr Glu 
130 135 140 

aat tta gaa aat aca gat ggt age age aga gga att ggt ata aat gta 4 80 
Asn Leu Glu Asn Thr Asp Gly Ser Ser Arg Gly He Gly He Asn Val 
145 150 155 160 

aga gca aga gaa ggg ttg aca ttt gac aat gat gga tac ttg gta gca 52 8 
Arg Ala Arg Glu Gly Leu Thr Phe Asp Asn Asp Gly Tyr Leu Val Ala 
165 170 175 

tgg aac cca aag tat gac acg cgc aca ctt tgg aca aca cca gac aca 576 
Trp Asn Pro Lys Tyr Asp Thr Arg Thr Leu Trp Thr Thr Pro Asp Thr 
180 185 190 

tct cca aac tgc aca att get caa gat aag gac tct aaa etc act ttg 624 
Ser Pro Asn Cys Thr He Ala Gin Asp Lys Asp Ser Lys Leu Thr Leu 
195 200 205 

gta ctt aca aag tgt gga agt caa ata tta get aat gtg tct ttg att 672 
Val Leu Thr Lys Cys Gly Ser Gin He Leu Ala Asn Val Ser Leu He 
210 215 220 

gtg gtc gca gga aag tac cac ate ata aat aat aag aca aat cca aaa 72 0 
Val Val Ala Gly Lys Tyr His He He Asn Asn Lys Thr Asn Pro Lys 
225 230 235 240 

ata aaa agt ttt act att aaa ctg eta ttt aat aag aac gga gtg ctt 768 
He Lys Ser Phe Thr He Lys Leu Leu Phe Asn Lys Asn Gly Val Leu 
245 250 ~ 255 

tta gac aac tea aat ctt gga aaa get tat tgg aac ttt aga agt gga 816 
Leu Asp Asn Ser Asn Leu Gly Lys Ala Tyr Trp Asn Phe Arg Ser Gly 
260 265 270 

aat tec aat gtt teg aca get tat gaa aaa gca att ggt ttt atg cct 864 
Asn Ser Asn Val Ser Thr Ala Tyr Glu Lys Ala He Gly Phe Met Pro 
275 280 ~ 285 

aat ttg gta gcg tat cca aaa ccc agt aat tct aaa aaa tat gca aga 912 
Asn Leu Val Ala Tyr Pro Lys Pro Ser Asn Ser Lys Lys Tyr Ala Arg 
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290 295 300 

gac ata gtt tat gga act ata tat ctt ggt gga aaa cct gat cag cca 960 
Asp lie Val Tyr Gly Thr lie Tyr Leu Gly Gly Lys Pro Asp Gin Pro 
305 310 315 320 

gca gtc att aaa act acc ttt aac caa gaa act gga tgt gaa tac tct 100 8 
Ala Val lie Lys Thr Thr Phe Asn Gin Glu Thr Gly Cys Glu Tyr Ser 
325 33 0 ~ 335 

ate aca ttt aac ttt agt tgg tec aaa acc tat gaa aat gtt gaa ttt 1056 
lie Thr Phe Asn Phe Ser Trp Ser Lys Thr Tyr Glu Asn Val Glu Phe 
340 345 ' 350 

gaa acc acc tct ttt acc ttc tec tat att gec caa gaa tga 1098 
Glu Thr Thr Ser Phe Thr Phe Ser Tyr lie Ala Gin Glu * 
355 360 365 



<210> 32 
<211> 365 
<212> PRT 

<213> Adenovirus type 3 7 



<400> 32 



Met 


Ser 


Lys 


Arg 


Leu 


Arg 


Val 


Glu 


Asp 


Asp 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 








5 










10 










15 




Tyr 


Gly 


Tyr 


Ala 


Arg 


Asn 


Gin 


Asn 


He 


Pro 


Phe 


Leu 


Thr 


Pro 


Pro 


Phe 








20 










25 










30 






Val 


Ser 


Ser 


Asp 


Gly 


Phe 


Lys 


Asn 


Phe 


Pro 


Pro 


Gly 


Val 


Leu 


Ser 


Leu 






35 










40 










45 








Lys 


Leu 


Ala 


Asp 


Pro 


He 


Thr 


He 


Thr 


Asn 


Gly 


Asp 


Val 


Ser 


Leu 


Lys 




50 










55 










60 










Val 


Gly 


Gly 


Gly 


Leu 


Thr 


Leu 


Gin 


Asp 


Gly 


Ser 


Leu 


Thr 


Val 


Asn 


Pro 


65 










70 










75 










80 


Lys 


Ala 


Pro 


Leu 


Gin 


Val 


Asn 


Thr 


Asp 


Lys 


Lys 


Leu 


Glu 


Leu 


Ala 


Tyr 










85 










90 










95 




Asp 


Asn 


Pro 


Phe 


Glu 


Ser 


Ser 


Ala 


Asn 


Lys 


Leu 


Ser 


Leu 


Lys 


Val 


Gly 








100 










105 










110 






His 


Gly 


Leu 


Lys 


Val 


Leu 


Asp 


Glu 


Lys 


Ser 


Ala 


Ala 


Gly 


Leu 


Lys 


Asp 






115 










120 










125 








Leu 


He 


Gly 


Lys 


Leu 


Val 


Val 


Leu 


Thr Gly Lys 


Gly 


He 


Gly 


Thr 


Glu 




130 










135 










140 










Asn 


Leu 


Glu 


Asn 


Thr 


Asp 


Gly 


Ser 


Ser Arg Gly 


He 


Gly 


He 


Asn 


Val 


145 










150 










155 










160 


Arg Ala 


Arg 


Glu 


Gly 


Leu 


Thr 


Phe 


Asp 


Asn 


Asp 


Gly 


Tyr 


Leu 


Val 


Ala 










165 










170 










175 




Trp 


Asn 


Pro 


Lys 


Tyr 


Asp 


Thr 


Arg 


Thr 


Leu 


Trp 


Thr 


Thr 


Pro 


Asp 


Thr 








180 










185 










190 






Ser 


Pro 


Asn 


Cys 


Thr 


He 


Ala 


Gin 


Asp 


Lys 


Asp 


Ser 


Lys 


Leu 


Thr 


Leu 






195 










200 










205 








Val 


Leu 


Thr 


Lys 


Cys 


Gly 


Ser 


Gin 


He 


Leu 


Ala 


Asn 


Val 


Ser 


Leu 


He 




210 










215 










220 










Val 


Val 


Ala 


Gly 


Lys 


Tyr 


His 


He 


He 


Asn 


Asn 


Lys 


Thr 


Asn 


Pro 


Lys 


225 










230 










235 










240 


lie 


Lys 


Ser 


Phe 


Thr 


He 


Lys 


Leu 


Leu 


Phe 


Asn 


Lys Asn Gly 


Val 


Leu 








245 










250 










255 




Leu Asp 


Asn 


Ser 


Asn 


Leu 


Gly 


Lys 


Ala 


Tyr 


Trp 


Asn 


Phe 


Arg 


Ser 


Gly 








260 










265 










270 






Asn 


Ser 


Asn 


Val 


Ser 


Thr 


Ala 


Tyr 


Glu 


Lys 


Ala 


He 


Gly Phe 


Met 


Pro 
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275 

Asn Leu Val Ala Tyr 
290 

Asp lie Val Tyr Gly 
305 

Ala Val lie Lys Thr 
325 

lie Thr Phe Asn Phe 
340 

Glu Thr Thr Ser Phe 
355 



280 

Pro Lys Pro Ser Asn Ser 
295 

Thr lie Tyr Leu Gly Gly 
310 315 
Thr Phe Asn Gin Glu Thr 
330 

Ser Trp Ser Lys Thr Tyr 
345 

Thr Phe Ser Tyr lie Ala 
360 



285 

Lys Lys Tyr Ala Arg 
300 

Lys Pro Asp Gin Pro 
320 

Gly Cys Glu Tyr Ser 
335 

Glu Asn Val Glu Phe 
350 

Gin Glu 
365 



<210> 33 
<211> 1098 
<212> DNA 

<213> Adenovirus type 19p 

<220> 

<221> CDS 

<222> (1) . . . (1098) 



20 25 30 

gtc tec tec gat gga ttc aaa aac ttc ccc cct ggg gta ctg tea etc 
Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro Gly Val Leu Ser Leu 
35 40 45 

aaa ctg get gat cca ate acc att acc aat ggg gat gta tec etc aag 
Lys Leu Ala Asp Pro He Thr He Thr .Asn Gly Asp Val Ser Leu Lys 
50 ~ 55 60 

gtg gga ggt ggt etc act ttg caa gat gga age eta act gta aac cct 
Val Gly Gly Gly Leu Thr Leu Gin Asp Gly Ser Leu Thr Val Asn Pro 
65 7 0 75 8 0 

aag get cca ctg caa gtt act act gat aaa aaa ctt gag ctt gca tat 
Lys Ala Pro Leu Gin Val Thr Thr Asp Lys Lys Leu Glu Leu Ala Tyr 
85 90 95 

gat aat cca ttt gaa tgt agt get aat aaa ttt agt tta aaa gta gga 
Asp Asn Pro Phe Glu Cys Ser Ala Asn Lys Phe Ser Leu Lys Val Gly 
100 105 110 



48 



<400> 33 

atg tea aag agg etc egg gtg gaa gat gac ttc aac ccc gtc tac ccc 
Met Ser Lys Arg Leu Arg Val Glu Asp Asp Phe Asn Pro Val Tyr Pro 
15 10 15 

tat ggc tac gcg egg aat cag aat ate ccc ttc etc act ccc ccc ttt 96 
Tyr Gly Tyr Ala Arg Asn Gin Asn He Pro Phe Leu Thr Pro Pro Phe 



144 



192 



240 



288 



336 



cat gga tta aaa gta tta gat gaa aaa agt get gcg ggg tta aaa gat 3 84 
His Gly Leu Lys Val Leu Asp Glu Lys Ser Ala Ala Gly Leu Lys Asp 
115 120 125 

tta att ggc aaa ctt gtg gtt tta aca gga aaa gga ata ggc act gaa 432 
Leu He Gly Lys Leu Val Val Leu Thr Gly Lys Gly He Gly Thr Glu 
130 " 135 140 



aat tta gaa aat aca gat ggt age age aga gga att ggt ata aat gta 
Asn Leu Glu Asn Thr Asp Gly Ser Ser Arg Gly He Gly He Asn Val 



480 



WO 2004/099422 



PCT/US2004/009219 



-58- 



145 150 155 160 

aga gca aga gaa ggg ttg aca ttt gac aat gat gga tac ttg gta gca 52 8 
Arg Ala Arg Glu Gly Leu Thr Phe Asp Asn Asp Gly Tyr Leu Val Ala 
165 170 175 

tgg aac cca aag tat gac acg cgc aca ctt tgg aca aca cca gac aca 5 76 
Trp Asn Pro Lys Tyr Asp Thr Arg Thr Leu Trp Thr Thr Pro Asp Thr 
180 185 190 

tct cca aac tgc aca att get cag gat aag gac tct aaa etc act ttg 624 
Ser Pro Asn Cys Thr lie Ala Gin Asp Lys Asp Ser Lys Leu Thr Leu 
195 200 205 

gta ctt aca aag tgt gga agt caa ata tta get aat gtg tct ttg att 672 
Val Leu Thr Lys Cys Gly Ser Gin lie Leu Ala Asn Val Ser Leu lie 
210 " 215 220 

gtg gtc gca gga aag tac cac ate ata aat aat aag aca aat cca gaa 72 0 
Val Val Ala Gly Lys Tyr His lie lie Asn Asn Lys Thr Asn Pro Glu 
225 230 235 240 

ata aaa agt ttt act att aaa ctg tta ttt aat aag aac gga gtg ctt 7 68 
He Lys Ser Phe Thr He Lys Leu Leu Phe Asn Lys Asn Gly Val Leu 
245 ~ 250 255 

tta gac aac tea aat ctt gga aaa get tat tgg aac ttt aga agt gga 816 
Leu Asp Asn Ser Asn Leu Gly Lys Ala Tyr Trp Asn Phe Arg Ser Gly 
260 265 270 



aat tec aat gtt teg aca get tat gaa aaa gca att ggt ttt atg cct 
Asn Ser Asn Val Ser Thr Ala Tyr Glu Lys Ala He Gly Phe Met Pro 
275 280 285 



gca gtc att aaa act ace ttt aac caa gaa act gga tgt gaa tac tct 
Ala Val He Lys Thr Thr Phe Asn Gin Glu Thr Gly Cys Glu Tyr Ser 
325 330 335 



gaa ace ace tct ttt ace ttc tec tat att gee caa gaa tga 
Glu Thr Thr Ser Phe Thr Phe Ser Tyr He Ala Gin Glu * 
355 360 365 



<210> 34 
<211> 365 
<212> PRT 

<213> Adenovirus type 19p 



864 



aat tta gta gcg tat cca aaa ccc agt aat tct aaa aaa tat gca aga 912 
Asn Leu Val Ala Tyr Pro Lys Pro Ser Asn Ser Lys Lys Tyr Ala Arg 
290 295 300 

gac ata gtt tat gga act ata tat ctt ggt gga aaa cct gat cag cca 960 
Asp He Val Tyr Gly Thr He Tyr Leu Gly Gly Lys Pro Asp Gin Pro 
305 310 315 320 



1008 



ate aca ttt gac ttt agt tgg tec aaa ace tat gaa aat gtt gaa ttt 1056 
He Thr Phe Asp Phe Ser Trp Ser Lys Thr Tyr Glu Asn Val Glu Phe 
340 345 350 



1098 
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<400> 34 

Met Ser Lys Arg Leu Arg Val Glu Asp Asp Phe Asn Pro Val Tyr Pro 

15 10 15 

Tyr Gly Tyr Ala Arg Asn Gin Asn lie Pro Phe Leu Thr Pro Pro Phe 

20 25 30 

Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro Gly Val Leu Ser Leu 

35 40 45 

Lys Leu Ala Asp Pro lie Thr lie Thr Asn Gly Asp Val Ser Leu Lys 

50 55 60 

Val Gly Gly Gly Leu Thr Leu Gin Asp Gly Ser Leu Thr Val Asn Pro 
65 ' 70 75 80 

Lys Ala Pro Leu Gin Val Thr Thr Asp Lys Lys Leu Glu Leu Ala Tyr 

85 90 95 

Asp Asn Pro Phe Glu Cys Ser Ala Asn Lys Phe Ser Leu Lys Val Gly 

100 105 110 

His Gly Leu Lys Val Leu Asp Glu Lys Ser Ala Ala Gly Leu Lys Asp 

115 120 125 

Leu lie Gly Lys Leu Val Val Leu Thr Gly Lys Gly lie Gly Thr Glu 

130 ^ 135 140 

Asn Leu Glu Asn Thr Asp Gly Ser Ser Arg Gly lie Gly lie Asn Val 
145 150 155 160 

Arg Ala Arg Glu Gly Leu Thr Phe Asp Asn Asp Gly Tyr Leu Val Ala 

165 170 175 

Trp Asn Pro Lys Tyr Asp Thr Arg Thr Leu Trp Thr Thr Pro Asp Thr 

180 ~ 185 190 

Ser Pro Asn Cys Thr lie Ala Gin Asp Lys Asp Ser Lys Leu Thr Leu 

195 ~ 200 205 

Val Leu Thr Lys Cys Gly Ser Gin He Leu Ala Asn Val Ser Leu He 

210 215 220 

Val Val Ala Gly Lys Tyr His He He Asn Asn Lys Thr Asn Pro Glu 
225 230 235 240 

He Lys Ser Phe Thr He Lys Leu Leu Phe Asn Lys Asn Gly Val Leu 

245 ~ 250 255 

Leu Asp Asn Ser Asn Leu Gly Lys Ala Tyr Trp Asn Phe Arg Ser Gly 

260 265 270 

Asn Ser Asn Val Ser Thr Ala Tyr Glu Lys Ala He Gly Phe Met Pro 

275 280 285 

Asn Leu Val Ala Tyr Pro Lys Pro Ser Asn Ser Lys Lys Tyr Ala Arg 

290 " 295 300 

Asp He Val Tyr Gly Thr He Tyr Leu Gly Gly Lys Pro Asp Gin Pro 
305 ~ 310 315 320 

Ala Val He Lys Thr Thr Phe Asn Gin Glu Thr Gly Cys Glu Tyr Ser 

325 330 335 

He Thr Phe Asp Phe Ser Trp Ser Lys Thr Tyr Glu Asn Val Glu Phe 

340 345 350 

Glu Thr Thr Ser Phe Thr Phe Ser Tyr He Ala Gin Glu 
355 360 365 



<210> 35 
<211> 1116 
<212> DNA 

<213> Adenovirus type 3 0 

<220> 

<221> CDS 

<222> (1) . . . (1116) 

<400> 35 

atg tea aag agg etc egg gtg gaa gat gac ttc aac ccc gtc tac ccc 4 8 
Met Ser Lys Arg Leu Arg Val Glu Asp Asp Phe Asn Pro Val Tyr Pro 
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15 10 15 

tat ggc tac gcg egg aat cag aat ate ccc ttc ctt act ccc ccc ttt 96 

Tyr Gly Tyr Ala Arg Asn Gin Asn lie Pro Phe Leu Thr Pro Pro Phe 
20 25 30 

gtc tea tec gat gga ttc aaa aac ttc cca ect ggg gtc ctg tea etc 144 

Val Ser Ser Asp Gly Phe Lys Asn Phe Pro Pro Gly Val Leu Ser Leu 
35 40 45 

aaa ctg get gac cca ate gec ate act aat ggg gat gtc tea etc aag 192 

Lys Leu Ala Asp Pro lie Ala lie Thr Asn Gly Asp Val Ser Leu Lys 
50 55 60 

Qtg gga ggg gga eta act gtg gaa caa gat agt gga aac eta agt gta 24 0 

Val Gly Gly Gly Leu Thr Val Glu Gin Asp Ser Gly Asn Leu Ser Val 
65 70 75 ~ 80 

aac cct aag get cca ttg caa gtt gga aca gac aaa aaa ctg gaa ttg 28 8 

Asn Pro Lys Ala Pro Leu Gin Val Gly Thr Asp Lys Lys Leu Glu Leu 

85 90 ~ ' 95 

get tta gca cct cca ttt gat gtc aga gat aac aag eta get att eta 33 6 

Ala Leu Ala Pro Pro Phe Asp Val Arg Asp Asn Lys Leu Ala lie Leu 
100 105 * 110 

gta gga gat gga tta aag gta ata gat aga tea ata tct gat ttg cca 3 84 

Val Gly Asp Gly Leu Lys Val lie Asp Arg Ser lie Ser Asp Leu Pro 

115 120 " ~ 125 

ggt ttg tta aac tat ctt gta gtt ttg act ggc aaa gga att gga aat 43 2 

Gly Leu Leu Asn Tyr Leu Val Val Leu Thr Gly Lys Gly lie Gly Asn 
130 135 140 

gaa gaa tta aaa aat gac gat ggt age aat aaa gga gtc ggt tta tgt 480 

Glu Glu Leu Lys Asn Asp Asp Gly Ser Asn Lys Gly Val Gly Leu Cys 

145 150 155 ~ 160 

gtg aga att gga gaa gga ggt ggt tta act ttt gat gat aaa ggt tat 52 8 

Val Arg lie Gly Glu Gly Gly Gly Leu Thr Phe Asp Asp Lys Gly Tyr 

165 170 175 

tta gta gca tgg aac aat aaa cat gac ate cgc aca ctt tgg aca act 576 

Leu Val Ala Trp Asn Asn Lys His Asp He Arg Thr Leu Trp Thr Thr 
180 185 190 

tta gac cct tct cca aat tgt aag ata gat ata gaa aaa gac tea aaa 624 

Leu Asp Pro Ser Pro Asn Cys Lys He Asp He Glu Lys Asp Ser Lys 
195 200 205 

eta act ttg gta ctg aca aag tgc gga agt cag att ttg gca aat gta 672 

Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He Leu Ala Asn Val 
210 215 220 

tct eta att ata gtc aac gga aag ttc aag ate ctt aat aac aaa aca 72 0 

Ser Leu He He Val Asn Gly Lys Phe Lys He Leu Asn Asn Lys Thr 

225 230 235 240 

gac cca tec eta cct aaa tea ttt aac ate aaa eta ctg ttt gat caa 768 

Asp Pro Ser Leu Pro Lys Ser Phe Asn He Lys Leu Leu Phe Asp Gin 

245 250 255 
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aat gga gtt eta ttg gaa aat tea aac att gaa aaa cag tac eta aac 
Asn Gly Val Leu Leu Glu Asn Ser Asn lie Glu Lys Gin Tyr Leu Asn 
260 265 ^ 270 



816 



ttt aga agt gga gac tct att ctt cca gag cca tat aaa aat gca att 864 
Phe Arg Ser Gly Asp Ser lie Leu Pro Glu Pro Tyr Lys Asn Ala lie 
275 280 285 

gga ttt atg cct aat tta eta get tat get aaa get aca act gat cag 912 
Gly Phe Met Pro Asn Leu Leu Ala Tyr Ala Lys Ala Thr Thr Asp Gin 
290 295 300 

tct aaa att tat gca agg aac act ata tat gga aat ate tac tta gat 960 
Ser Lys lie Tyr Ala Arg Asn Thr He Tyr Gly Asn He Tyr Leu Asp 
305 310 315 " 320 

aat cag cca tat aat cca gtt gta att aaa att act ttt aat aat gaa 1008 
Asn Gin Pro Tyr Asn Pro Val Val He Lys He Thr Phe Asn Asn Glu 
325 330 335 

gca gat agt get tat tct ate act ttt aac tat tea tgg ace aag gac 1056 
Ala Asp Ser Ala Tyr Ser He Thr Phe Asn Tyr Ser Trp Thr Lys Asp 
340 345 350 

tat gac aat ate cct ttt gat tct act tea ttt acc ttc tec tat ate 1104 
Tyr Asp Asn He Pro Phe Asp Ser Thr Ser Phe Thr Phe Ser Tyr He 
355 360 365 

gee caa gaa tga 1116 
Ala Gin Glu * 
370 



<210> 36 
<211> 370 
<212> PRT 

<213> Adenovirus type 3 0 



<400> 36 



Ser 


Lys 


Arg 


Leu 


Arg 


Val 


Glu 


Asp Asp Phe Asn Pro 


Val 


Tyr 


Pro 


T Y r 


1 








5 






10 






15 


Gly 


Tyr 


Ala 


Arg 


Asn 


Gin 


Asn 


He Pro Phe Leu Thr 


Pro 


Pro 


Phe 


Val 








20 








25 




30 






Ser 


Ser 


Asp 


Gly 


Phe 


Lys 


Asn 


Phe Pro Pro Gly Val 


Leu 


Ser 


Leu 


Lys 






35 










40 


45 






Leu 


Ala 


Asp 


Pro 


He 


Ala 


He 


Thr Asn Gly Asp Val 


Ser 


Leu 


Lys 


Val 




50 










55 


60 








Gly 


Gly 


Gly 


Leu 


Thr 


Val 


Glu 


Gin Asp Ser Gly Asn 


Leu 


Ser 


Val 


Asn 


65 










70 




75 








80 


Pro 


Lys 


Ala 


Pro 


Leu 


Gin 


Val 


Gly Thr Asp Lys Lys 


Leu 


Glu 


Leu 


Ala 










85 






90 






95 




Leu 


Ala 


Pro 


Pro 


Phe 


Asp 


Val 


Arg Asp Asn Lys Leu 


Ala 


He 


Leu 


Val 








100 








105 




110 






Gly 


Asp 


Gly 


Leu 


Lys 


Val 


He 


Asp Arg Ser He Ser Asp 


Leu 


Pro 


Gly 






115 










120 


125 






Leu 


Leu 


Asn 


Tyr 


Leu 


Val 


Val 


Leu Thr Gly Lys Gly 


He 


Gly Asn Glu 




130 










135 


14 0 










Glu 


Leu 


Lys 


Asn 


Asp 


Asp 


Gly 


Ser Asn Lys Gly Val 


Gly 


Leu 


Cys 


Val 


145 










150 




155 






160 


Arg 


He 


Gly 


Glu 


Gly 


Gly 


Gly 


Leu Thr Phe Asp Asp 


Lys 


Gly Tyr 


Leu 
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165 170 175 

Val Ala Trp Asn Asn Lys His Asp lie Arg Thr Leu Trp Thr Thr Leu 

180 185 190 

Asp Pro Ser Pro Asn Cys Lys lie Asp lie Glu Lys Asp Ser Lys Leu 

195 200 205 

Thr Leu Val Leu Thr Lys Cys Gly Ser Gin lie Leu Ala Asn Val Ser 

210 215 .220 

Leu lie lie Val Asn Gly Lys Phe Lys lie Leu Asn Asn Lys Thr Asp 
225 230 235 240 

Pro Ser Leu Pro Lys Ser Phe Asn lie Lys Leu Leu Phe Asp Gin Asn 

245 250 255 

Gly Val Leu Leu Glu Asn Ser Asn lie Glu Lys Gin Tyr Leu Asn Phe 

260 265 270 

Arg Ser Gly Asp Ser lie Leu Pro Glu Pro Tyr Lys Asn Ala lie Gly 

275 280 285 

Phe Met Pro Asn Leu Leu Ala Tyr Ala Lys Ala Thr Thr Asp Gin Ser 

290 295 300 

Lys lie Tyr Ala Arg Asn Thr lie Tyr Gly Asn lie Tyr Leu Asp Asn 
305 310 315 320 

Gin Pro Tyr Asn Pro Val Val lie Lys lie Thr Phe Asn Asn Glu Ala 

325 330 335 

Asp Ser Ala Tyr Ser lie Thr Phe Asn Tyr Ser Trp Thr Lys Asp Tyr 

340 345 350 

Asp Asn lie Pro Phe Asp Ser Thr Ser Phe Thr Phe Ser Tyr lie Ala 
355 360 365 

Gin Glu 
370 



<210> 37 
<211> 1062 
<212> DNA 

<213> Adenovirus type 16 

<220> 
<221> CDS 

<222> (1) . . . (1062) 



<400> 37 

atg gcc aaa cga get egg eta age age tec ttc aat ccg gtc tac ccc 
Met Ala Lys Arg Ala Arg Leu Ser Ser Ser Phe Asn Pro Val Tyr Pro 
1 5 10 15 

tat gaa gat gaa age age tea caa cac ccc ttt ata aac cct ggt ttc 
Tyr Glu Asp Glu Ser Ser Ser Gin His Pro Phe lie Asn Pro Gly Phe 
20 25 30 



aaa tgt gtt aat cca etc act acc gcc age gga ccc etc caa ctt aaa 

Lys Cys Val Asn Pro Leu Thr Thr Ala Ser Gly Pro Leu Gin Leu Lys 

• 50 55 60 

gtt gga age agt ctt aca gta gat act ate gat ggg tct ttg gag gaa 

Val Gly Ser Ser Leu Thr Val Asp Thr lie Asp Gly Ser Leu Glu Glu 

65 70 75 80 



48 



96 



att tec tea aat ggt ttt gca caa age cca gat gga gtt eta act ctt 144 
lie Ser Ser Asn Gly Phe Ala Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 40 45 



192 



240 



aat ata act gcc gca gcg cca etc act aaa act aac cac tec ata ggt 2 88 
Asn He Thr Ala Ala Ala Pro Leu Thr Lys Thr Asn His Ser He Gly 
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85 90 95 

tta tta ata gga tct ggc ttg caa aca aag gat gat aaa ctt tgt tta 33 6 
Leu Leu lie Gly Ser Gly Leu Gin Thr Lys Asp Asp Lys Leu Cys Leu 
100 105 110 

teg ctg gga gat ggg ttg gta aca aag gat gat aaa eta tgt tta teg 3 84 
Ser Leu Gly Asp Gly Leu Val Thr Lys Asp Asp Lys Leu Cys Leu Ser 
115 120 125 

ct g gga gat ggg tta ata aca aaa aat gat gta eta tgt gee aaa eta 432 
Leu Gly Asp Gly Leu lie Thr Lys Asn Asp Val Leu Cys Ala Lys Leu 
130 ~ 135 140 

gga cat ggc ctt gtg ttt gac tct tec aat get ate acc ata gaa aac 4 80 
Gly His Gly Leu Val Phe Asp Ser Ser Asn Ala lie Thr He Glu Asn 
145 150 155 160 

aac acc ttg tgg aca ggc gca aaa cca age gec aac tgt gta att aaa 52 8 
Asn Thr Leu Trp Thr Gly Ala Lys Pro Ser Ala Asn Cys Val He Lys 
165 170 175 

9 a 9 99 a 9 aa 9 at tcc cca 9 ac fc 9 fc aa 9 ctc act tta 9 tt: cta 9 fc g aa g 576 
Glu Gly Glu Asp Ser Pro Asp Cys Lys Leu Thr Leu Val Leu Val Lys 
180 185 190 

aat gga gga ctg ata aat gga tac ata aca tta atg gga gec tea gaa 624 
Asn Gly Gly Leu lie Asn Gly Tyr He Thr Leu Met Gly Ala Ser Glu 
195 200 205 

tat act aac acc ttg ttt aaa aac aat caa gtt aca ate gat gta aac 672 
Tyr Thr Asn Thr Leu Phe Lys Asn Asn Gin Val Thr He Asp Val Asn 
210 215 220 

ctc gca ttt gat aat act ggc caa att att act tac eta tea tcc ctt 720 
Leu Ala Phe Asp Asn Thr Gly Gin He He Thr Tyr Leu Ser Ser Leu 
225 ~ 230 235 240 

aaa agt aac ctg aac ttt aaa gac aac caa aac atg get act gga acc 7 68 
Lys Ser Asn Leu Asn Phe Lys Asp Asn Gin Asn Met Ala Thr Gly Thr 
245 250 255 

ata acc agt gec aaa ggc ttc atg ccc age acc acc gee tat cca ttt 816 
He Thr Ser Ala Lys Gly Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe 
260 265 270 

ata aca tac gee act gag acc eta aat gaa gat tac att tat gga gag 864 
He Thr Tyr Ala Thr Glu Thr Leu Asn Glu Asp Tyr He Tyr Gly Glu 
275 280 285 

tgt tac tac aaa tct acc aat gga act ctc ttt cca eta aaa gtt act 912 
Cys Tyr Tyr Lys Ser Thr Asn Gly Thr Leu Phe Pro Leu Lys Val Thr 
290 ~ 295 300 

gtc aca eta aac aga cgt atg tta get tct gga atg gec tat get atg 960 
Val Thr Leu Asn Arg Arg Met Leu Ala Ser Gly Met Ala Tyr Ala Met 
305 ~ 310 315 320 

aat ttt tea tgg tct eta aat gca gag gaa gec ccg gaa act acc gaa 10 0 8 
Asn Phe Ser Trp Ser Leu Asn Ala Glu Glu Ala Pro Glu Thr Thr Glu 
325 330 335 
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gtc act etc att acc tec ccc ttc ttt ttt tct tat ate aga gaa gat 1056 
Val Thr Leu lie Thr Ser Pro Phe Phe Phe Ser Tyr lie Arg Glu Asp 
340 345 350 

gac tga 1062 
Asp * 



<210> 38 
<211> 353 
<212> PRT 

<213> Adenovirus type 16 
<400> 38 

Met Ala Lys Arg Ala Arg Leu Ser Ser Ser Phe Asn Pro Val Tyr Pro 

1 " " 5 10 15 

Tyr Glu Asp Glu Ser Ser Ser Gin His Pro Phe lie Asn Pro Gly Phe 

20 25 30 

He Ser Ser Asn Gly Phe Ala Gin Ser Pro Asp Gly Val Leu Thr Leu 

35 " 40 45 

Lys Cys Val Asn Pro Leu Thr Thr Ala Ser Gly Pro Leu Gin Leu Lys 

50 55 60 

Val Gly Ser Ser Leu Thr Val Asp Thr He Asp Gly Ser Leu Glu Glu 
65 ~ 70 75 80 

Asn He Thr Ala Ala Ala Pro Leu Thr Lys Thr Asn His Ser He Gly 

85 90 95 

Leu Leu He Gly Ser Gly Leu Gin Thr Lys Asp Asp Lys Leu Cys Leu 

100 ' 105 110 

Ser Leu Gly Asp Gly Leu Val Thr Lys Asp Asp Lys Leu Cys Leu Ser 

115 ~ 12 0 12 5 

Leu Gly Asp Gly Leu He Thr Lys Asn Asp Val Leu Cys Ala Lys Leu 

130 ~ " 135 140 

Gly His Gly Leu Val Phe Asp Ser Ser Asn Ala He Thr He Glu Asn 
145 150 ~ 155 160 

Asn Thr Leu Trp Thr Gly Ala Lys Pro Ser Ala Asn Cys Val He Lys 

165 170 175 

Glu Gly Glu Asp Ser Pro Asp Cys Lys Leu Thr Leu Val Leu Val Lys 

180 185 190 

Asn Gly Gly Leu He Asn Gly Tyr He Thr Leu Met Gly Ala Ser Glu 

195 200 205 

Tyr Thr Asn Thr Leu Phe Lys Asn Asn Gin Val Thr He Asp Val Asn 

210 215 220 

Leu Ala Phe Asp Asn Thr Gly Gin He He Thr Tyr Leu Ser Ser Leu 
225 230 235 240 

Lys Ser Asn Leu Asn Phe Lys Asp Asn Gin Asn Met Ala Thr Gly Thr 

245 250 255 

He Thr Ser Ala Lys Gly Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe 

260 265 270 

He Thr Tyr Ala Thr Glu Thr Leu Asn Glu Asp Tyr He Tyr Gly Glu 

275 280 285 

Cys Tyr Tyr Lys Ser Thr Asn Gly Thr Leu Phe Pro Leu Lys Val Thr 

290 295 . 300 

Val Thr Leu Asn Arg Arg Met Leu Ala Ser Gly Met Ala Tyr Ala Met 
305 310 315 320 

Asn Phe Ser Trp Ser Leu Asn Ala Glu Glu Ala Pro Glu Thr Thr Glu 

325 330 335 

Val Thr Leu He Thr Ser Pro Phe Phe Phe Ser Tyr He Arg Glu Asp 
340 345 350 

Asp 
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<210> 39 
<211> 972 
<212> DNA 

<213> Adenovirus type 3 5 

<220> 

<221> CDS 

<222> (1) . . . (972) 

<400> 39 

atg acc aag aga gtc egg etc agt gac tec ttc aac cct gtc tac ccc 4 8 
Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 
1 " 5 10 15 

tat gaa gat gaa age acc tec caa cac ccc ttt ata aac cca ggg ttt 9 6 
Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 
20 25 30 

att tec cca aat ggc ttc aca caa age cca gac gga gtt ctt act tta 144 
lie Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 ^ 40 45 



aaa tgt tta acc cca eta aca acc aca ggc gga tct eta cag eta aaa 
Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 
50 55 60 

gtg gga ggg gga ctt aca gtg gat gac act gat ggt acc tta caa gaa 
Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 70 75 8 0 

aac ata cgt get aca gca ccc att act aaa aat aat cac tct gta gaa 
Asn lie Arg Ala Thr Ala Pro lie Thr Lys Asn Asn His Ser Val Glu 
85 90 95 



aaa ttg gga aat ggg tta aaa ttt aac aac ggt gac att tgt ata aag 

Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp lie Cys lie Lys 

115 ^ 120 125 

gat agt att aac acc tta tgg act gga ata aac cct cca cct aac tgt 

Asp Ser lie Asn Thr Leu Trp Thr Gly lie Asn Pro Pro Pro Asn Cys 
130 135 140 

caa att gtg gaa aac act aat aca aat gat ggc aaa ctt act tta gta 

Gin lie Val Glu Asn Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val 

145 150 155 160 



192 



240 



288 



eta tec att gga aat gga tta gaa act caa aac aat aaa eta tgt gee 336 
Leu Ser lie Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 
100 ~ 105 110 



384 



432 



480 



tta gta aaa aat gga ggg ctt gtt aat ggc tac gtg tct eta gtt ggt 52 8 
Leu Val Lys Asn Gly Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly 
165 ~ 170 175 

gta tea gac act gtg aac caa atg ttc aca caa aag aca gca aac ate 576 
Val Ser Asp Thr Val Asn Gin Met Phe Thr Gin Lys Thr Ala Asn He 
180 185 190 
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caa tta aga tta tat ttt gac tct tct gga aat eta tta act gag gaa 
Gin Leu Arg Leu Tyr Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu 
19 5 200 205 

tea gac tta aaa att cca ctt aaa aat aaa tct tct aca gcg acc agt 
Ser Asp Leu Lys lie Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser 
210 215 220 

gaa act gta gec age age aaa gee ttt atg cca agt act aca get tat 
Glu Thr val Ala Ser Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr 
225 230 235 240 



624 



672 



720 



ccc ttc aac acc act act agg gat agt gaa aac tac att cat gga ata 768 
Pro Phe Asn Thr Thr Thr Arg Asp Ser Glu Asn Tyr He His Gly lie 
245 250 255 

tgt tac tac atg act agt tat gat aga agt eta ttt ccc ttg aac att 816 
Cys Tyr Tyr Met Thr Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn lie 
260 265 270 

tct ata atg eta aac age cgt atg att tct tec aat gtt gee tat gee 864 
Ser lie Met Leu Asn Ser Arg Met lie Ser Ser Asn Val Ala Tyr Ala 
275 280 285 

ata caa ttt gaa tgg aat eta aat gca agt gaa tct cca gaa age aac 912 
lie Gin Phe Glu Trp Asn Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn 
290 295 300 

ata get acg ctg acc aca tec ccc ttt ttc ttt tct tac att aca gaa 960 
lie Ala Thr Leu Thr Thr Ser Pro Phe Phe Phe Ser Tyr lie Thr Glu 
305 310 315 320 

gac gac aac taa 972 
Asp Asp Asn * 



<210> 40 
<211> 323 
<212> PRT 

<213> Adenovirus type 35 
<400> 40 



Met 


Thr 




Arg 


Val 


Arg 


Leu 


Ser 


Asp 


Ser Phe Asn Pro 


Val 


Tyr 


Pro 


1 








5 










10 




15 




Tyr 


Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His 


Pro Phe He Asn 


Pro 


Gly 


Phe 








20 










25 




30 




lie 


Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 


Ser 


Pro Asp Gly Val 


Leu 


Thr 


Leu 






35 










40 




45 








Lys 


Cys 


Leu 


Thr 


Pro 


Leu 


Thr Thr Thr Gly Gly Ser Leu 


Gin 


Leu 


Lys 




50 










55 






60 






Val 


Gly 


Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp 


Thr Asp Gly Thr 


Leu 


Gin 


Glu 


65 










70 








75 






80 


Asn 


He 


Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys Asn Asn His 


Ser 


Val 


Glu 










85 










90 




95 




Leu 


Ser 


He 


Gly 


Asn 


Gly 


Leu 


Glu 


Thr 


Gin Asn Asn Lys 


Leu 


Cys 


Ala 








100 










105 


110 




Lys 


Leu 


Gly 


Asn 


Gly 


Leu 


Lys 


Phe 


Asn 


Asn Gly Asp He 


Cys 


He 


Lys 






115 










120 




125 




Asp 


Ser 


He 


Asn 


Thr 


Leu 


Trp 


Thr 


Gly 


He Asn Pro Pro 


Pro 


Asn 


Cys 




130 










135 




140 
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Gin 


lie 


Val 


Glu 


Asn 


Thr 


Asn 


Thr 


Asn Asp 


Gly 


Lys 


Leu 


Thr 


Leu 


Val 


145 










150 








155 










160 


Leu 


Val 


Lys 


Asn 


Gly 


Gly 


Leu 


Val 


Asn Gly 


Tyr 


Val 


Ser 


Leu 


Val 


Gly 










165 








170 










175 




Val 


Ser 


As P 


Thr 


Val 


Asn 


Gin 


Met 


Phe Thr 


Gin 


Lys 


Thr 


Ala 


Asn 


He 






180 










185 








190 






Gin 


Leu 


Arg 


Leu 


Tyr 


Phe 


Asp 


Ser 


Ser Gly Asn 


Leu 


Leu 


Thr 


Glu 


Glu 






195 










200 








205 








Ser 


Asp 


Leu 


Lys 


lie 


Pro 


Leu 


Lys 


Asn Lys 


Ser 


Ser 


Thr 


7\ 1 


Tnr 


Q d 




210 










215 








220 










Glu 


Thr 


Val 


Ala 


Ser 


Ser 


Lys 


Ala 


Phe Met 


Pro 


Ser 


Thr 


Thr 


Ala 


Tyr 


225 










230 








235 










240 


Pro 


Phe 


Asn 


Thr 


Thr 


Thr 


Arg 


Asp 


Ser Glu 


Asn 


Tyr 


He 


His 


Gly 


He 










245 








250 










255 




Cys 


Tyr 


Tyr 


Met 


Thr 


Ser 


Tyr 


Asp 


Arg Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 






260 










265 








270 






Ser 


He 


Met 


Leu 


Asn 


Ser 


Arg 


Met 


He Ser 


Ser 


Asn 


Val 


Ala 


Tyr 


Ala 






275 








280 








285 








lie 


Gin 


Phe 


Glu 


Trp 


Asn 


Leu 


Asn 


Ala Ser 


Glu 


Ser 


Pro 


Glu 


Ser 


Asn 




290 








295 








300 










lie 


Ala 


Thr 


Leu 


Thr 


Thr 


Ser 


Pro 


Phe Phe 


Phe 


Ser 


Tyr 


He 


Thr 


Glu 


305 










310 








315 










320 


Asp 


Asp 


Asn 



























<210> 41 
<211> 1062 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5/Adl6 chimeric fiber 



48 



96 



<221> CDS 

<222> (1) . . . (1062) 
<400> 41 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
1 ~ 5 10 15 

tat gaa gat gaa age age tea caa cac ccc ttt ata aac cct ggt ttc 
Tyr Glu Asp Glu Ser Ser Ser Gin His Pro Phe He Asn Pro Gly Phe 
20 25 30 

att tec tea aat ggt ttt gca caa age cca gat gga gtt eta act ctt 144 
He Ser Ser Asn Gly Phe Ala Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 40 45 

aaa tgt gtt aat cca etc act acc gec age gga ccc etc caa ctt aaa 
Lys Cys Val Asn Pro Leu Thr Thr Ala Ser Gly Pro Leu Gin Leu Lys 
50 55 60 

gtt gga age agt ctt aca gta gat act ate gat ggg tct ttg gag gaa 
Val Gly Ser Ser Leu Thr Val Asp Thr He Asp Gly Ser Leu Glu Glu 
65 70 75 80 

aat ata act gec gca gcg cca etc act aaa act aac cac tec ata ggt 
Asn He Thr Ala Ala Ala Pro Leu Thr Lys Thr Asn His Ser He Gly 
85 90 95 



192 



240 



288 
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tta tta ata gga tct ggc ttg caa aca aag gat gat aaa ctt tgt tta 33 6 
Leu Leu lie Gly Ser Gly Leu Gin Thr Lys Asp Asp Lys Leu Cys Leu 
100 105 110 

teg ctg gga gat ggg ttg gta aca aag gat gat aaa eta tgt tta teg 3 84 
Ser Leu Gly Asp Gly Leu Val Thr Lys Asp Asp Lys Leu Cys Leu Ser 
115 120 125 

ctg gga gat ggg tta ata aca aaa aat gat gta eta tgt gee aaa eta 432 
Leu Gly Asp Gly Leu lie Thr Lys Asn Asp Val Leu Cys Ala Lys Leu 
130 135 140 

gga cat ggc ctt gtg ttt gac tct tec aat get ate ace ata gaa aac 480 
Gly His Gly Leu Val Phe Asp Ser Ser Asn Ala He Thr He Glu Asn 
145 150 155 160 

aac ace ttg tgg aca ggc gca aaa cca age gee aac tgt gta att aaa 528 
Asn Thr Leu Trp Thr Gly Ala Lys Pro Ser Ala Asn Cys Val He Lys 
165 170 " 175 

gag gga gaa gat tec cca gac tgt aag etc act tta gtt eta gtg aag 576 
Glu Gly Glu Asp Ser Pro Asp Cys Lys Leu Thr Leu Val Leu Val Lys 
180 185 ' 190 

aat gga gga ctg ata aat gga tac ata aca tta atg gga gee tea gaa 624 
Asn Gly Gly Leu He Asn Gly Tyr He Thr Leu Met Gly Ala Ser Glu 
195 200 205 

tat act aac acc ttg ttt aaa aac aat caa gtt aca ate gat gta aac 672 
Tyr Thr Asn Thr Leu Phe Lys Asn Asn Gin Val Thr He Asp Val Asn 
210 215 220 

etc gca ttt gat aat act ggc caa att att act tac eta tea tec ctt 720 
Leu Ala Phe Asp Asn Thr Gly Gin He He Thr Tyr Leu Ser Ser Leu 
225 230 235 240 

aaa agt aac ctg aac ttt aaa gac aac caa aac atg get act gga acc 768 
Lys Ser Asn Leu Asn Phe Lys Asp Asn Gin Asn Met Ala Thr Gly Thr 
245 250 255 

ata acc agt gee aaa ggc ttc atg ccc age acc acc gee tat cca ttt 816 
He Thr Ser Ala Lys Gly Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe 
260 265 270 

ata aca tac gee act gag acc eta aat gaa gat tac att tat gga gag 864 
He Thr Tyr Ala Thr Glu Thr Leu Asn Glu Asp Tyr He Tyr Gly Glu 
275 280 285 

tgt tac tac aaa tct acc aat gga act etc ttt cca eta aaa gtt act 912 
Cys Tyr Tyr Lys Ser Thr Asn Gly Thr Leu Phe Pro Leu Lys Val Thr 
290 295 300 

gtc aca eta aac aga cgt atg tta get tct gga atg gee tat get atg 960 
Val Thr Leu Asn Arg Arg Met Leu Ala Ser Gly Met Ala Tyr Ala Met 
305 310 315 320 

aat ttt tea tgg tct eta aat gca gag gaa gee ccg gaa act acc gaa 1008 
Asn Phe Ser Trp Ser Leu Asn Ala Glu Glu Ala Pro Glu Thr Thr Glu 
325 330 335 
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gtc act etc att acc tec ccc ttc ttt ttt tct tat ate aga gaa gat 1056 
Val Thr Leu lie Thr Ser Pro Phe Phe Phe Ser Tyr lie Arg Glu Asp 
340 345 " 350 

gac tga 10 62 
Asp * 



<210> 42 
<211> 353 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5/Adl6 chimeric fiber 



<400> 42 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


1 








5 








Tyr 


Glu 


Asp 


Glu 


Ser 


Ser 


Ser 


Gin 








20 










lie 


Ser 


Ser 


Asn 


Gly 


Phe 


Ala 


Gin 






35 










40 


Lys 


Cys 


Val 


Asn 


Pro 


Leu 


Thr 


Thr 




50 










55 




Val 


Gly 


Ser 


Ser 


Leu 


Thr 


Val 


Asp 


65 










70 






Asn 


He 


Thr 


Ala 


Ala 


Ala 


Pro 


Leu 










85 








Leu 


Leu 


He 


Gly 


Ser 


Gly 


Leu 


Gin 








100 










Ser 


Leu 


Gly 


Asp 


Gly 


Leu 


Val 


Thr 






115 










120 


Leu 


Gly 


Asp 


Gly 


Leu 


He 


Thr 


Lys 




130 










135 




Gly His 


Gly 


Leu 


Val 


Phe 


Asp 


Ser 


145 










150 






Asn 


Thr 


Leu 


Trp 


Thr 


Gly 


Ala 


Lys 










165 








Glu 


Gly 


Glu 


Asp 


Ser 


Pro 


Asp 


Cys 








180 










Asn 


Gly 


Gly 


Leu 


He 


Asn 


Gly 


Tyr 






195 










200 


Tyr 


Thr 


Asn 


Thr 


Leu 


Phe 


Lys 


Asn 




210 










215 




Leu 


Ala 


Phe 


Asp 


Asn 


Thr 


Gly 


Gin 


225 










230 






Lys 


Ser 


Asn 


Leu 


Asn 


Phe 


Lys 


Asp 










245 








lie 


Thr 


Ser 


Ala 


Lys 


Gly 


Phe 


Met 








260 








lie 


Thr 


Tyr 


Ala 


Thr 


Glu 


Thr 


Leu 






275 










280 


Cys 


Tyr 


Tyr 


Lys 


Ser 


Thr 


Asn 


Gly 




290 










295 




Val 


Thr 


Leu 


Asn 


Arg 


Arg 


Met 


Leu 


305 










310 






Asn 


Phe 


Ser 


Trp 


Ser 


Leu 


Asn 


Ala 








325 








Val 


Thr 


Leu 


He 


Thr 


Ser 


Pro 


Phe 



Asp 


Thr 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 




10 










15 




His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 


25 










30 




Ser 


Pro 


Asp 


Gly Val 


Leu 


Thr 


Leu 










45 








Ala 


Ser 


Gly Pro 


Leu 


Gin 


Leu 


L Y S 








60 










Thr 


He 


Asp 


Gly 


Ser 


Leu 


Glu 


Glu 






75 










80 


Thr 


Lys 


Thr 


Asn 


His 


Ser 


He 


Gly 




90 










95 




Thr 


Lys 


Asp 


Asp 


Lys 


Leu 


Cys 


Leu 


105 










110 






Lys 


Asp 


Asp 


Lys 


Leu 


Cys 


Leu 


Ser 










125 








Asn 


Asp 


Val 


Leu 


Cys 


Ala 


Lys 


Leu 








140 










Ser 


Asn 


Ala 


He 


Thr 


He 


Glu 


Asn 






155 










160 


Pro 


Ser 


Ala 


Asn 


Cys 


Val 


He 


Lys 




170 










175 




Lys 


Leu 


Thr 


Leu 


Val 


Leu 


Val 


Lys 


185 










190 






He 


Thr 


Leu 


Met 


Gly Ala 


Ser 


Glu 










205 








Asn 


Gin 


Val 


Thr 


He 


Asp 


Val 


Asn 








220 










He 


He 


Thr 


Tyr 


Leu 


Ser 


Ser 


Leu 






235 










240 


Asn 


Gin 


Asn 


Met 


Ala 


Thr 


Gly 


Thr 




250 










255 




Pro 


Ser 


Thr 


Thr 


Ala 


Tyr 


Pro 


Phe 


265 










270 






Asn 


Glu 


Asp 


Tyr 


He 


Tyr 


Gly Glu 










285 








Thr 


Leu 


Phe 


Pro 


Leu 


Lys 


Val 


Thr 








300 










Ala 


Ser 


Gly Met 


Ala 


Tyr 


Ala 


Met 






315 










320 


Glu 


Glu 


Ala 


Pro 


Glu 


Thr 


Thr 


Glu 




330 










335 




Phe 


Phe 


Ser 


Tyr 


He 


Arg 


Glu 


Asp 
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340 345 350 

Asp 



<210> 43 
<211> 972 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5/Ad35 chimeric fiber 

<221> CDS 

<222> (1) . . . (972) 

<400> 43 

atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat cca 48 
Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
15 10 15 

tat gaa gat gaa age acc tec caa cac ccc ttt ata aac cca ggg ttt 96 
Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 
2 0 2 5 3 0 

att tec cca aat ggc ttc aca caa age cca gac gga gtt ctt act tta 144 
lie Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val Leu Thr Leu 
35 " 40 45 

aaa tgt tta acc cca eta aca acc aca ggc gga tct eta cag eta aaa 192 
Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 
50 55 60 

gtg gga ggg gga ctt aca gtg gat gac act gat ggt acc tta caa gaa 240 
Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 70 75 80 

aac ata cgt get aca gca ccc att act aaa aat aat cac tct gta gaa 2 88 
Asn He Arg Ala Thr Ala Pro He Thr Lys Asn Asn His Ser Val Glu 
85 90 95 

eta tec att gga aat gga tta gaa act caa aac aat aaa eta tgt gec 3 36 
Leu Ser He Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 
100 ~ 105 110 

aaa ttg gga aat ggg tta aaa ttt aac aac ggt gac att tgt ata aag 3 84 
Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp He Cys He Lys 
115 " 120 125 

gat agt att aac acc tta tgg act gga ata aac cct cca cct aac tgt 43 2 
Asp Ser He Asn Thr Leu Trp Thr Gly lie Asn Pro Pro Pro Asn Cys 
130 135 140 

caa att gtg gaa aac act aat aca aat gat ggc aaa ctt act tta gta 480 
Gin He Val Glu Asn Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val 
145 150 155 160 

tta gta aaa aat gga ggg ctt gtt aat ggc tac gtg tct eta gtt ggt 52 8 
Leu Val Lys Asn Gly Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly 
165 170 175 
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gta tea gac act gtg aac caa atg ttc aca caa aag aca gca aac ate 576 
Val Ser Asp Thr Val Asn Gin Met Phe Thr Gin Lys Thr Ala Asn lie 
180 185 190 

caa tta aga tta tat ttt gac tct tct gga aat eta tta act gag gaa 624 
Gin Leu Arg Leu Tyr Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu 
195 200 205 

tea gac tta aaa att cca ctt aaa aat aaa tct tct aca gcg acc agt 672 
Ser Asp Leu Lys lie Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser 
210 215 220 

gaa act gta gec age age aaa gee ttt atg cca agt act aca get tat 720 
Glu Thr Val Ala Ser Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr 
225 230 235 240 

ccc ttc aac acc act act agg gat agt gaa aac tac att cat gga ata 768 
Pro Phe Asn Thr Thr Thr Arg Asp Ser Glu Asn Tyr lie His Gly He 
245 250 255 

tgt tac tac atg act agt tat gat aga agt eta ttt ccc ttg aac att 816 
Cys Tyr Tyr Met Thr Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He 
260 ~ 265 270 

tct ata atg eta aac age cgt atg att tct tec aat gtt gee tat gee 864 
Ser He Met Leu Asn Ser Arg Met He Ser Ser Asn Val Ala Tyr Ala 
275 280 285 

ata caa ttt gaa tgg aat eta aat gca agt gaa tct cca gaa age aac 912 
He Gin Phe Glu Trp Asn Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn 
290 295 300 

ata get acg ctg acc aca tec ccc ttt ttc ttt tct tac att aca gaa 960 
He Ala Thr Leu Thr Thr Ser Pro Phe Phe Phe Ser Tyr He Thr Glu 
305 310 315 320 

gac gac aac taa 972 
Asp Asp Asn * 



<210> 44 

<211> 323 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5/Ad35 chimeric fiber 



<400> 44 



Met 


Lys Arg 


Ala Arg 


Pro 


Ser 


Glu 


1 




5 








Tyr Glu Asp 


Glu Ser 


Thr 


Ser 


Gin 






20 








He 


Ser Pro 


Asn Gly 


Phe 


Thr 


Gin 




35 








40 


Lys 


Cys Leu 


Thr Pro 


Leu 


Thr 


Thr 


50 






55 




Val 


Gly Gly 


Gly Leu 


Thr 


Val 


Asp 


65 






70 






Asn 


He Arg 


Ala Thr 


Ala 


Pro 


He 



Asp 


Thr Phe 


Asn Pro 


Val 


Tyr 


Pro 




10 






15 




His 


Pro Phe 


lie Asn 


Pro 


Gly 


Phe 


25 






30 






Ser 


Pro Asp 


Gly Val 


Leu 


Thr 


Leu 






45 








Thr Gly Gly 


Ser Leu 


Gin 


Leu 


Lys 






60 








Asp 


Thr Asp 


Gly Thr Leu Gin 


Glu 




75 








80 


Thr 


Lys Asn 


Asn His 


Ser 


Val 


Glu 
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Leu 


Ser 


He 


Gly 








100 


Lys 


Leu 


Gly 


Asn 






115 




Asp 


Ser 


He 


Asn 




130 






Gin 


He 


Val 


Glu 


145 








Leu 


Val 


Lys 


Asn 


Val 


Ser 


Asp 


Thr 








180 


Gin 


Leu 


Arg 


Leu 






195 




Ser 


Asp 


Leu 


Lys 




210 






Glu 


Thr 


Val 


Ala 


225 








Pro 


Phe 


Asn 


Thr 


Cys 


Tyr 


Tyr 


Met 








260 


Ser 


He 


Met 


Leu 






275 




He 


Gin 


Phe 


Glu 




290 






He 


Ala 


Thr 


Leu 


305 








Asp 


Asp 


Asn 





85 

Asn Gly Leu Glu 

Gly Leu Lys Phe 
120 

Thr Leu Trp Thr 
135 

Asn Thr Asn Thr 
150 

Gly Gly Leu Val 
165 

Val Asn Gin Met 

Tyr Phe Asp Ser 
200 

He Pro Leu Lys 
215 

Ser Ser Lys Ala 
23 0 

Thr Thr Arg Asp 
245 

Thr Ser Tyr Asp 

Asn Ser Arg Met 
280 

Trp Asn Leu Asn 
295 

Thr Thr Ser Pro 
310 



90 

Thr Gin Asn Asn 
105 

Asn Asn Gly Asp 

Gly He Asn Pro 
140 

Asn Asp Gly Lys 
155 

Asn Gly Tyr Val 
170 

Phe Thr Gin Lys 
185 

Ser Gly Asn Leu 

Asn Lys Ser Ser 
220 

Phe Met Pro Ser 
235 

Ser Glu Asn Tyr 
250 

Arg Ser Leu Phe 
265 

He Ser Ser Asn 

Ala Ser Glu Ser 
300 

Phe Phe Phe Ser 
315 



95 

Lys Leu Cys Ala 
110 

He Cys He Lys 
125 

Pro Pro Asn Cys 

Leu Thr Leu Val 
160 

Ser Leu Val Gly 
175 

Thr Ala Asn He 
190 

Leu Thr Glu Glu 
205 

Thr Ala Thr Ser 

Thr Thr Ala Tyr 
240 

He His Gly He 
255 

Pro Leu Asn He 
270 

Val Ala Tyr Ala 
285 

Pro Glu Ser Asn 

Tyr He Thr Glu 
320 



<210> 45 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> HSP binding motif 

<400> 45 
Lys Lys Thr Lys 
1 



<210> 46 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> conserved sequence 

<400> 46 
Thr Leu Trp Thr 
1 



<210> 47 
<211> 7607 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GRE5-El-SV4 0-Hygro 
<400> 47 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 12 0 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 18 0 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 24 0 
gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 3 60 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 42 0 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 60 0 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 72 0 
ggaaatattc ttattggtag aaacaactac atcctggtca tcatcctgcc tttctcttta 780 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 84 0 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 960 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 102 0 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 10 80 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 114 0 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 12 0 0 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 13 2 0 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 144 0 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 150 0 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 162 0 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 174 0 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 192 0 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 198 0 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2 04 0 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 210 0 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 222 0 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
ggggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2 52 0 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2 58 0 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2 64 0 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2 70 0 
gcattttgac aattacagag gatgggcagg ggctaaaggg ggtaaagagg gagcgggggg 2 760 
cttgtgaggc tacagaggag gctaggaatc tagcttttag cttaatgacc agacaccgtc 2 82 0 
ctgagtgtat tacttttcaa cagatcaagg ataattgcgc taatgagctt gatctgctgg 2880 
cgcagaagta ttccatagag cagctgacca cttactggct gcagccaggg gatgattttg 2 940 
aggaggctat tagggtatat gcaaaggtgg cacttaggcc agattgcaag tacaagatca 3 000 
gcaaacttgt aaatatcagg aattgttgct acatttctgg gaacggggcc gaggtggaga 3 060 
tagatacgga ggatagggtg gcctttagat gtagcatgat aaatatgtgg ccgggggtgc 312 0 
ttggcatgga cggggtggtt attatgaatg taaggtttac tggccccaat tttagcggta 3180 
cggttttcct ggccaatacc aaccttatcc tacacggtgt aagcttctat gggtttaaca 3240 
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atacctgtgt ggaagcctgg accgatgtaa 
ggaagggggt ggtgtgtcgc cccaaaagca 
ggtgtacctt gggtatcctg tctgagggta 
actgtggttg cttcatgcta gtgaaaagcg 
gcaactgcga ggacagggcc tctcagatgc 
tgaagaccat tcacgtagcc agccactctc 
tactgacccg ctgttccttg catttgggta 
gcaatttgag tcacactaag atattgcttg 
acggggtgtt tgacatgacc atgaagatct 
ccaggtgcag accctgcgag tgtggcggta 
atgtgaccga ggagctgagg cccgatcact 
gctctagcga tgaagataca gattgaggta 
gaaagaatat ataaggtggg ggtcttatgt 
ccgccatgag caccaactcg tttgatggaa 
tgcccccatg ggccggggtg cgtcagaatg 
tcctgcccgc aaactctact accttgacct 
ctgcagcctc cgccgccgct tcagccgctg 
ttgctttcct gagcccgctt gcaagcagtg 
agttgacggc tcttttggca caattggatt 
agcagctgtt ggatctgcgc cagcaggttt 
cggtttaaaa cataaataaa aaaccagact 
gctgtctcag ctgactgctt aagtcgcaag 
ttaaagcaga acttgtttat tgcagcttat 
aatttcacaa ataaagcatt tttttcactg 
aatgtatctt atcatgtctg gtcgactcta 
ctgcgctcgg tcgttcggct gcggcgagcg 
ttatccacag aatcagggga taacgcagga 
gccaggaacc gtaaaaaggc cgcgttgctg 
gagcatcaca aaaatcgacg ctcaagtcag 
taccaggcgt ttccccctgg aagctccctc 
accggatacc tgtccgcctt tctcccttcg 
tgtaggtatc tcagttcggt gtaggtcgtt 
cccgttcagc ccgaccgctg cgccttatcc 
agacacgact tatcgccact ggcagcagcc 
gtaggcggtg ctacagagtt cttgaagtgg 
gtatttggta tctgcgctct gctgaagcca 
tgatccggca aacaaaccac cgctggtagc 
acgcgcagaa aaaaaggatc tcaagaagat 
cagtggaacg aaaactcacg ttaagggatt 
acctagatcc ttttaaatta aaaatgaagt 
acttggtctg acagttacca atgcttaatc 
tttcgttcat ccatagttgc ctgactcccc 
ttaccatctg gccccagtgc tgcaatgata 
ttatcagcaa taaaccagcc agccggaagg 
tccgcctcca tccagtctat taattgttgc 
aatagtttgc gcaacgttgt tgccattgct 
ggtatggctt cattcagctc cggttcccaa 
ttgtgcaaaa aagcggttag ctccttcggt 
gcagtgttat cactcatggt tatggcagca 
gtaagatgct tttctgtgac tggtgagtac 
cggcgaccga gttgctcttg cccggcgtca 
actttaaaag tgctcatcat tggaaaacgt 
ccgctgttga gatccagttc gatgtaaccc 
tttactttca ccagcgtttc tgggtgagca 
ggaataaggg cgacacggaa atgttgaata 
agcatttatc agggttattg tctcatgagc 
aaacaaatag gggttccgcg cacatttccc 
attattatca tgacattaac ctataaaaat 
gcgtttcggt gatgacggtg aaaacctctg 
ttgtctgtaa gcggatgccg ggagcagaca 
cgggtgtcgg ggctggctta actatgcggc 



gggttcgggg ctgtgccttt tactgctgct 33 00 
gggcttcaat taagaaatgc ctctttgaaa 33 60 
actccagggt gcgccacaat gtggcctccg 342 0 
tggctgtgat taagcataac atggtatgtg 348 0 
tgacctgctc ggacggcaac tgtcacctgc 3 54 0 
gcaaggcctg gccagtgttt gagcataaca 360 0 
acaggagggg ggtgttccta ccttaccaat 3 66 0 
agcccgagag catgtccaag gtgaacctga 3 72 0 
ggaaggtgct gaggtacgat gagacccgca 3780 
aacatattag gaaccagcct gtgatgctgg 3 84 0 
tggtgctggc ctgcacccgc gctgagtttg 3 90 0 
ctgaaatgtg tgggcgtggc ttaagggtgg 3 96 0 
agttttgtat ctgttttgca gcagccgccg 4020 
gcattgtgag ctcatatttg acaacgcgca 4 08 0 
tgatgggctc cagcattgat ggtcgccccg 4140 
acgagaccgt gtctggaacg ccgttggaga 420 0 
cagccaccgc ccgcgggatt gtgactgact 4260 
cagcttcccg ttcatccgcc cgcgatgaca 432 0 
ctttgacccg ggaacttaat gtcgtttctc 43 8 0 
ctgccctgaa ggcttcctcc cctcccaatg 4440 
ctgtttggat ttggatcaag caagtgtctt 4500 
ccgaattgga tccaattcgg atcgatctta 4560 
aatggttaca aataaagcaa tagcatcaca 4 62 0 
cattctagtt gtggtttgtc caaactcatc 4680 
gactcttccg cttcctcgct cactgactcg 4740 
gtatcagctc actcaaaggc ggtaatacgg 4 80 0 
aagaacatgt gagcaaaagg ccagcaaaag 4860 
gcgtttttcc ataggctccg cccccctgac 4920 
aggtggcgaa acccgacagg actataaaga 49 80 
gtgcgctctc ctgttccgac cctgccgctt 5040 
ggaagcgtgg cgctttctca tagctcacgc 510 0 
cgctccaagc tgggctgtgt gcacgaaccc 5160 
ggtaactatc gtcttgagtc caacccggta 522 0 
actggtaaca ggattagcag agcgaggtat 52 80 
tggcctaact acggctacac tagaaggaca 53 4 0 
gttaccttcg gaaaaagagt tggtagctct 54 0 0 
ggtggttttt ttgtttgcaa gcagcagatt 54 60 
cctttgatct tttctacggg gtctgacgct 5520 
ttggtcatga gattatcaaa aaggatcttc 5580 
tttaaatcaa tctaaagtat atatgagtaa 5640 
agtgaggcac ctatctcagc gatctgtcta 5700 
gtcgtgtaga taactacgat acgggagggc 5760 
ccgcgagacc cacgctcacc ggctccagat 582 0 
gccgagcgca gaagtggtcc tgcaacttta 58 80 
cgggaagcta gagtaagtag ttcgccagtt 5 94 0 
acaggcatcg tggtgtcacg ctcgtcgttt 6000 
cgatcaaggc gagttacafcg atcccccatg 6060 
cctccgatcg ttgtcagaag taagttggcc 612 0 
ctgcataatt ctcttactgt catgccatcc 6180 
tcaaccaagt cattctgaga atagtgtatg 6240 
atacgggata ataccgcgcc acatagcaga 63 0 0 
tcttcggggc gaaaactctc aaggatctta 63 60 
actcgtgcac ccaactgatc ttcagcatct 642 0 
aaaacaggaa ggcaaaatgc cgcaaaaaag 64 8 0 
ctcatactct tcctttttca atattattga 6540 
ggatacatat ttgaatgtat ttagaaaaat 66 0 0 
cgaaaagtgc cacctgacgt ctaagaaacc 6660 
aggcgtatca cgaggcccct ttcgtctcgc 6720 
acacatgcag ctcccggaga cggtcacagc 6780 
agcccgtcag ggcgcgtcag cgggtgttgg 6840 
atcagagcag attgtactga gagtgcacca 6900 
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tatgcggtgt 
agcgttaata 
caataggccg 
agtgttgttc 

gggcgaaaaa 

tttttggggt 
agagcttgac 
gcgggcgcta 
gcgcttaatg 
gaagggcgat 
gcaaggcgat 
gccagtgaat 



gaaataccgc 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 

gggcgctggc 

cgccgctaca 
cggtgcgggc 
taagttgggt 
tgtaatacga 



acagatgcgt 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagcacta 
ggcgaacgtg 
aagtgtagcg 
gggcgcgtcc 
ctcttcgcta 
aacgccaggg 
ctcactatag 



<210> 48 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> MMTV-E2a-SV4 0-Neo 



aaggagaaaa 
aatttttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
gcgagaaagg 
gtcacgctgc 
cattcgccat 
ttacgccagc 
ttttcccagt 
ggcgaattaa 



taccgcatca 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 
tcaggctgcg 
tggcgaaagg 
cacgacgttg 
ttcgggg 



ggaaattgta 
attttttaac 
gatagggttg 
caacgtcaaa 
ctaatcaagt 
cccccgattt 
agcgaaagga 
cacacccgcc 
caactgttgg 

gggatgtgct 
taaaacgacg 



6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7607 



<400> 48 

gaattccgca 

tgaccattca 

gagtgtccta 

tctgcgcaca 

ggcatagctc 

ctcacccttg 

aactcgacct 

ttttgtcctt 

accaatccaa 

actattttta 

acctcaattg 

aaaaagagtg 

ccttacatct 

tgggataggt 

gaaagactcg 

atgaaacaac 

attttccata 

catggcatga 

agtaagtttt 

cttggtttgg 

ttggaattta 

attttttgag 

gccatcccgt 

cggcgaccct 

gtgacccttg 

ggctatcatc 

cgtccaagac 

gcgccctgcc 

ggtatcttgc 

acggggtaga 

ggtacgcgcg 

ggcacatcgc 

ttcaacagca 

acttgttcct 

gagcggtcct 

tcctcaagcg 

gcggtggtgg 

atctttttct 

acccccgagc 



ttgcagagat 
ccacattggt 
cacctagggg 
aacggatgag 
tgctttgcct 
actcttttaa 
tcctcctgag 
cagaaataga 
taggtagatt 
ctcaaattca 
aagaacaggt 
tttttgtcaa 
acagaccaac 
gggttacagt 
ccagagctag 
aggtacatga 
ccaaggaggg 
gttattatga 
tggttacaaa 
tatcaaaggt 
tccaaatctt 
taaacttgca 
ctccgctcgt 
caggtcggcc 
tctctatttc 
acaagagcgg 
cctcaaagat 
gcaaggccag 
agttttggaa 
agttgaggcg 
gtgagaatag 
tgcgctcttg 
cgtcgtctcc 
cgtttgcctc 
cgtcgtcttc 
ggggtgcctc 
cgaactcaga 
gcctatagga 
gcggacgcgg 



attgtattta 
gtgcacctcc 
agaagcagcc 
cccatcagac 

ggggctattg 

tagctcttct 
gcaaggacca 
aataagaatg 
attagttact 
gaagttagaa 
gcaaggacta 
aataggagac 
agatgccccc 
caatggctat 
acctccttgg 
ttatatttat 
gacagtggct 
atagccttta 
ctgttcttaa 
tctgatctga 
atgtaaatgc 
acagtcctaa 
cacttatcct 
gactgcggca 
tactatttgg 
aacggactca 
ttttggcact 
ctgcttgtcc 
aaagatgtga 
cgggttgggc 
gtggcgttcg 
caacgcgtcg 
cacatctagg 
tgcgttgtcc 
gcttacaaaa 
gacggggaag 
gggggcggtt 
gaaggaaatg 
tgcggcgcga 



agtgcctagc 
aagcttgggc 
aaggggttgt 
aaagacatat 
ggggaagttg 
gtgcaagatt 
cagccaactt 
cttgctaaaa 
atgttaagaa 
atgggaatag 
ttgaccacag 
aggtggtggc 
ttaccatata 
aaagtgttat 
tgtatgttgt 
ctaggaacag 
ggactaatag 
ttggcccaac 
aacgaggatg 
gctctgagtg 
ttatgtaaac 
cattcacctc 
tcactttcca 
gctggcgccc 
tgtttgtctt 
ccatagggac 
tcgttgagcg 
gctcggctgc 
taggtggcaa 
tcgcatgtgc 
taggcaaggc 
cagataatgg 
tagtcgccat 
tggtcttgct 
cctgggtcct 
gtggtaggcg 
aggctgtcct 
gccagtcggg 
cgtcccccaa 



tcgatacaat 
agaaatggtt 
ttcccaccaa 
tcattctctg 
cggttcgtgc 
acaatctaaa 
cctcttacaa 
attatatttt 
atgaatcatt 
aaaatagaaa 
gcctagaagt 
aaccagggac 
caggaagata 
atagatccct 
ctcaagaaga 
gaatgcactt 
aacattattc 
cttgcggttc 
tgagacaagt 
ttctattttc 
caagatataa 
ttgtgtgttt 
gagggtcccc 
gaacagggac 
gtattgtctc 
caagctagcg 
aggcgatatc 
ggttggcacg 
gcacctctgg 
cgttttcttg 
tgacatccgc 
cgcactggcg 
gcctttcgtc 
ttttatcctc 
gctcgataat 
cgttggcggc 
tcttctcgac 
aagaggagca 
ccatggagga 



aaacgccatt 
gaactcccga 
ggacgacccg 
ctgcaaactt 
tcgcagggct 
caattcggag 
gccgcatcga 
taccaataag 
atcttttagt 
gagacgctca 
aaaaaaggga 
ttatagggga 
tgacttaaat 
cccttttcgt 
aaaagacgac 
ttggggaaag 
tgcaaaaact 
ccagggctta 
ggtttcctga 
ctatgttctt 
aagagtgctg 
gtgtctgttc 
ccgcagaccc 
cctcggataa 
tttcttgtct 
cttctcgtcg 
aggtatgaca 
gcaggatagg 
cacggcaaat 
gcgtttgggg 
tatggcgagg 
ctgcagatgc 
cccccgcccg 
tgttggtact 
cacttcctcc 
atcggtggag 
tgactccatg 
gcgcgaaacc 
cgtgtcgtcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 
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ccgtccccgt 
gagtccgagg 
cggccatcga 
tctcccaagc 
gaagatgtgg 
aaaggaggta 
cggacgcaag 
ccgctgagtg 
atggacaagt 
gtggaagctc 
ctgaccttca 
ctgcagtcgt 
ctgcaccgct 
aataaggagc 
gagcagtcta 
tccaacaccg 
ggcaagtctt 
atcaaggctt 
atgccactac 
ctaccaaagt 
tccgacaaga 
aaccctgtgt 
tcggcgcccg 
accgagctgc 
aacgtgtccc 
gcgcagacgg 
cctttattga 
gtggcgctcc 
ctgtagagct 
gagttgggta 
cagttggcaa 
ccctgtacgc 
gcgctacggc 
gaaaagccaa 
atccagtgta 
tgtggagaaa 
ccaagatctt 
gttgatgttg 
acacggtttc 
acaaaacata 
ttacctcagg 
aattggacaa 
tataatgtgt 
ctgatgaatg 
aaatgccatc 
agaagagaaa 
atgctgtgtt 
ctgcactgct 
acagttataa 
ttaataacta 
aggaatattt 
gaggttttac 
aatgcaattg 
agcatcacaa 
aaactcatca 
tgtggaaagt 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 



cgccgccgcc 
acgaggaaga 
cctcggcggc 
ccgagcgccc 
cgctacaaat 
agcgcacagt 
aggaagagga 
tgccgatcgt 
accacgtgga 
tggcggccgt 
ccagcaacaa 
ttgcagaggt 
gcgctgagat 
acgtgattga 
gcaaggccaa 
acgcaaggtg 
gcggcatgtt 
ttatgcaggc 
ggtgcgagtg 
tgactccgtt 
gcgtgctggc 
atcgcaactc 
acctgctaaa 
cgcggatggt 
tgccagtggc 
caagggtggg 
aagtgtctct 
taatctgcgc 
gttcctggtt 
ccccggtaat 
aggcgtggag 
tttgggtgga 
gcaggagtga 
agggctcaaa 
cgcccccagt 
caaagcctgg 
tgacaatggc 
cttgcttctt 
gatgacgccg 
aagaagggtg 
tcgacctcga 
actacctaca 
taaactactg 
ggagcagtgg 
tagtgatgat 
ggtagaagac 
tagtaataga 
atacaagaaa 
tcataacata 
tgctcaaaaa 
gatgtatagt 
ttgctttaaa 
ttgttgttaa 
atttcacaaa 
atgtatctta 
ccccaggctc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctcggcctc 
gcaaaaagct 



tccccgggcg 
ctcatcacaa 
ggatttggcc 
gccatcacca 
ggtgggtttc 
gcggcggctg 
agagcccagc 
gtctgcgtgg 
taacgatcta 
atgcaagacc 
gacctttgtg 
gacctacaag 
cgaaggcgag 
aatggatgtg 
gatcgtgaag 
ctgcgtgcac 
cttctctgaa 
gctgtatcct 
caactcaaag 
cgccctgagc 
cagcgtgcac 
gcgcgcgcag 
cgcgttggtg 
tgtgcctgag 
gcatagcgat 
ggtaaataat 
agtacattat 
actgtggctg 
gcgacgcagg 
aaggttcatg 
aaacatgcag 
cttttccagc 
ctcgtactca 
gaggtagcat 
ctcgcgaccg 
aaagcgcttg 
tttcagttcc 
tatgttgtgg 
cggtgcggcc 
ggctcgtcca 
gggatctttg 
gagatttaaa 
attctaattg 
tggaatgcct 
gaggctactg 
cccaaggact 
actcttgctt 
attatggaaa 
ctgttttttc 
ttgtgtacct 
gccttgacta 
aaacctccca 
cttgtttatt 
taaagcattt 
tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
tcacgctgcc 



cccccaaaaa 
gacgcgctgg 
attgcgccca 
gaggtaatcg 
agcaacccac 
aatgaagacg 
gaagcggaaa 
gagaagggca 
aaggcgaact 
tggctgaacg 
acgatgatgg 
catcacgagc 
cttaagtgtc 
acgagcgaaa 
aaccggtggg 
gacgcggcct 
ggcgcaaagg 
aacgcccaga 
cctgggcacg 
aacgcggagg 
cacccggcgc 
ggcggaggcc 
atggtgcgca 
tttaagtgga 
gcgcggcaga 
cacccgagag 
ttttacatgt 
cggaagtagg 
gtgggctgta 
gtggggttgt 
cagaatagtc 
gttatacagc 
aactggtaaa 
gtttttgagt 
gccgtattga 
tcataggtgc 
tgctcactgg 
cgttgccggc 
ggtgcacacg 
tgggatccat 
tgaaggaacc 
gctctaaggt 
tttgtgtatt 
ttaatgagga 
ctgactctca 
ttccttcaga 
gctttgctat 
aatattctgt 
ttactccaca 
ttagcttttt 
gagatcataa 
cacctccccc 
gcagcttata 
ttttcactgc 
atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 



agcggatgag 
tgccgcgcac 
agaagaaaaa 
tggacagcga 
cggtgctaat 
acccagtggc 
gtgaaattac 

tggaggctgc 

tcaaactact 
aggagcaccg 
ggcgattcct 
ccacgggctg 
tacacggaag 
acgggcagcg 
gccgaaatgt 
gtccggccaa 
ctcaggtggc 
ccgggcacgg 
cgcccttttt 
acctggacgc 
tgatagtgtt 
ccaactgcga 
gcctgtggag 
gcactaaaca 
acccctttga 
tgtacaaata 
ttttcaagtg 
gcgagtggcg 
cctggggact 
gatccatggg 
cacaggcggc 
ggtcggggga 
cctgcttgag 
gcgggttcca 
ctatggcgca 
ccaaaaaata 
agcccatggc 
cgagaagggc 
gaccacgtca 
atatagggcc 
ttacttctgt 
aaatataaaa 
ttagattcca 
aaacctgttt 
acattctact 
attgctaagt 
ttacaccaca 
aacctttata 
caggcataga 
aatttgtaaa 
tcagccatac 
tgaacctgaa 
atggttacaa 
attctagttg 
ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
agggcgcaag 



gcggcgtatc 
acccagcccg 
gaagcgccct 
ggaagaaaga 
caagcatggc 
gcgtggtatg 
ggtgatgaac 
gcgcgcgctg 
gcctgaccaa 
cgggttgcag 
gcaggcgtac 
cgcgttgtgg 
cattatgata 
cgcgctgaag 
ggtgcagatc 
tcagttttcc 
ttttaagcag 
tcaccttttg 
gggaaggcag 
ggatctgatc 
ccagtgctgc 
cttcaagata 
tgaaaacttc 
ccagtatcgc 
tttttaaacg 
aaagcatttg 
acaaaaagaa 
ctccaggaag 
gttgagcatg 
agtttggggc 
cgagttgggc 
agaagcaatg 
tcgctggtca 
ggcaaaggcc 
ggcgagcttg 
tggcccacaa 
ggcagctgtt 
gtgcgcaggt 
aagacttcaa 
cgggttataa 
ggtgtgacat 
tttttaagtg 
acctatggaa 
tgctcagaag 
cctccaaaaa 
tttttgagtc 
aaggaaaaag 
agtaggcata 
gtgtctgcta 
ggggttaata 
cacatttgta 
acataaaatg 
ataaagcaat 
tggtttgtcc 
tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
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ggaagcggaa cacgtagaaa gccagtccgc agaaacggtg ctgaccccgg atgaatgtca 6060 
gctactgggc tatctggaca agggaaaacg caagcgcaaa gagaaagcag gtagcttgca 612 0 
gtgggcttac atggcgatag ctagactggg cggttttatg gacagcaagc gaaccggaat 618 0 
tgccagctgg ggcgccctct ggtaaggttg ggaagccctg caaagtaaac tggatggctt 624 0 
tcttgccgcc aaggatctga tggcgcaggg gatcaagatc tgatcaagag acaggatgag 63 0 0 
gatcgtttcg catgattgaa caagatggat tgcacgcagg ttctccggcc gcttgggtgg 6360 
agaggctatt cggctatgac tgggcacaac agacaatcgg ctgctctgat gccgccgtgt 642 0 
tccggctgtc agcgcagggg cgcccggttc tttttgtcaa gaccgacctg tccggtgccc 648 0 
tgaatgaact gcaggacgag gcagcgcggc tatcgtggct ggccacgacg ggcgttcctt 654 0 
gcgcagctgt gctcgacgtt gtcactgaag cgggaaggga ctggctgcta ttgggcgaag 660 0 
tgccggggca ggatctcctg tcatctcacc ttgctcctgc cgagaaagta tccatcatgg 6660 
ctgatgcaat gcggcggctg catacgcttg atccggctac ctgcccattc gaccaccaag 672 0 
cgaaacatcg catcgagcga gcacgtactc ggatggaagc cggtcttgtc gatcaggatg 67 8 0 
atctggacga agagcatcag gggctcgcgc cagccgaact gttcgccagg ctcaaggcgc 684 0 
gcatgcccga cggcgaggat ctcgtcgtga cccatggcga tgcctgcttg ccgaatatca 690 0 
tggtggaaaa tggccgcttt tctggattca tcgactgtgg ccggctgggt gtggcggacc 696 0 
gctatcagga catagcgttg gctacccgtg atattgctga agagcttggc ggcgaatggg 7020 
ctgaccgctt cctcgtgctt tacggtatcg ccgctcccga ttcgcagcgc atcgccttct 7080 
atcgccttct tgacgagttc ttctgagcgg gactctgggg ttcgaaatga ccgaccaagc 7140 
gacgcccaac ctgccatcac gagatttcga ttccaccgcc gccttctatg aaaggttggg 72 0 0 
cttcggaatc gttttccggg acgccggctg gatgatcctc cagcgcgggg atctcatgct 7260 
ggagttcttc gcccaccccg ggctcgatcc cctcgcgagt tggttcagct gctgcctgag 732 0 
gctggacgac ctcgcggagt tctaccggca gtgcaaatcc gtcggcatcc aggaaaccag 73 8 0 
cagcggctat ccgcgcatcc atgcccccga actgcaggag tggggaggca cgatggccgc 744 0 
tttggtcccg gatctttgtg aaggaacctt acttctgtgg tgtgacataa ttggacaaac 75 0 0 
tacctacaga gatttaaagc tctaaggtaa atataaaatt tttaagtgta taatgtgtta 7560 
aactactgat tctaattgtt tgtgtatttt agattccaac ctatggaact gatgaatggg 762 0 
agcagtggtg gaatgccttt aatgaggaaa acctgttttg ctcagaagaa atgccatcta 768 0 
gtgatgatga ggctactgct gactctcaac attctactcc tccaaaaaag aagagaaagg 7740 
tagaagaccc caaggacttt ccttcagaat tgctaagttt tttgagtcat gctgtgttta 7 80 0 
gtaatagaac tcttgcttgc tttgctattt acaccacaaa ggaaaaagct gcactgctat 7 86 0 
acaagaaaat tatggaaaaa tattctgtaa cctttataag taggcataac agttataatc 792 0 
ataacatact gttttttctt actccacaca ggcatagagt gtctgctatt aataactatg 7980 
ctcaaaaatt gtgtaccttt agctttttaa tttgtaaagg ggttaataag gaatatttga 8040 
tgtatagtgc cttgactaga gatcataatc agccatacca catttgtaga ggttttactt 810 0 
gctttaaaaa acctcccaca cctccccctg aacctgaaac ataaaatgaa tgcaattgtt 816 0 
gttgttaact tgtttattgc agcttataat ggttacaaat aaagcaatag catcacaaat 822 0 
ttcacaaata aagcattttt ttcactgcat tctagttgtg gtttgtccaa actcatcaat 8280 
gtatcttatc atgtctggat ccccaggaag ctcctctgtg tcctcataaa ccctaacctc 8340 
ctctacttga gaggacattc caatcatagg ctgcccatcc accctctgtg tcctcctgtt 840 0 
aattaggtca cttaacaaaa aggaaattgg gtaggggttt ttcacagacc gctttctaag 84 6 0 
ggtaatttta aaatatctgg gaagtccctt ccactgctgt gttccagaag tgttggtaaa 852 0 
cagcccacaa atgtcaacag cagaaacata caagctgtca gctttgcaca agggcccaac 8580 
accctgctca tcaagaagca ctgtggttgc tgtgttagta atgtgcaaaa caggaggcac 8 64 0 
attttcccca cctgtgtagg ttccaaaata tctagtgttt tcatttttac ttggatcagg 8700 
aacccagcac tccactggat aagcattatc cttatccaaa acagccttgt ggtcagtgtt 8760 
catctgctga ctgtcaactg tagcattttt tggggttaca gtttgagcag gatatttggt 8 82 0 
cctgtagttt gctaacacac cctgcagctc caaaggttcc ccaccaacag caaaaaaatg 8880 
aaaatttgac ccttgaatgg gttttccagc accattttca tgagtttttt gtgtccctga 8 94 0 
atgcaagttt aacatagcag ttaccccaat aacctcagtt ttaacagtaa cagcttccca 900 0 
catcaaaata tttccacagg ttaagtcctc atttaaatta ggcaaaggaa ttcttgaaga 906 0 
cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat aatggtttct 912 0 
tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 918 0 
taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 924 0 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 93 0 0 
gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 936 0 
gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 942 0 
cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa agttctgcta 9480 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 954 0 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 960 0 
atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
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ttacttctga 
gatcatgtaa 
gagcgtgaca 
gaactactta 
gcaggaccac 
gccggtgagc 
cgtatcgtag 
atcgctgaga 
tatatacttt 
ctttttgata 
gaccccgtag 
tgcttgcaaa 
ccaactcttt 
ctagtgtagc 
gctctgctaa 
ttggactcaa 
tgcacacagc 
ctatgagaaa 
agggtcggaa 
agtcctgtcg 
gggcggagcc 
tggccttttg 
accgcctttg 
gtgagcgagg 
atttcacacc 
cagtatctgc 
gctacaacaa 
tttgcgctgc 
tttaggcgaa 
tttcgctttt 
acatggtaac 
gattggtgga 
catggattgg 



caacgatcgg 
ctcgccttga 
ccacgatgcc 
ctctagcttc 
ttctgcgctc 
gtgggtctcg 
ttatctacac 
taggtgcctc 
agattgattt 
atctcatgac 
aaaagatcaa 
caaaaaaacc 
ttccgaaggt 
cgtagttagg 
tcctgttacc 
gacgatagtt 
ccagcttgga 
gcgccacgct 
caggagagcg 
ggtttcgcca 
tatggaaaaa 
ctcacatgtt 
agtgagctga 
aagcggaaga 
gcatatggtg 
tccctgcttg 
ggcaaggctt 
ttcgcgatgt 
aagcggggct 
gcatagggag 
gatgagttag 
agtaaggtgg 
acgaaccact 



aggaccgaag 
tcgttgggaa 
tgcagcaatg 
ccggcaacaa 
ggcccttccg 
cggtatcatt 
gacggggagt 
actgattaag 
aaaacttcat 
caaaatccct 
aggatcttct 
accgctacca 
aactggcttc 
ccaccacttc 
agtggctgct 
accggataag 
gcgaacgacc 
tcccgaaggg 
cacgagggag 
cctctgactt 
cgccagcaac 
ctttcctgcg 
taccgctcgc 
gcgcctgatg 
cactctcagt 
tgtgttggag 
gaccgacaat 
acgggccaga 
tcggttgtac 

ggggaaatgt 

caacatgcct 
tacgatcgtg 



gagctaaccg 
ccggagctga 
gcaacaacgt 
ttaatagact 
gctggctggt 
gcagcactgg 
caggcaacta 
cattggtaac 
ttttaattta 
taacgtgagt 
tgagatcctt 
gcggtggttt 
agcagagcgc 
aagaactctg 
gccagtggcg 
gcgcagcggt 
tacaccgaac 
agaaaggcgg 
cttccagggg 
gagcgtcgat 
gcggcctttt 
ttatcccctg 
cgcagccgaa 
cggtattttc 
acaatctgct 
gtcgctgagt 
tgcatgaaga 
tatacgcgta 
gcggttagga 
agtcttatgc 
tacaaggaga 
ccttattagg 



cttttttgca 
atgaagccat 
tgcgcaaact 
ggatggaggc 
ttattgctga 
ggccagatgg 
tggatgaacg 
tgtcagacca 
aaaggatcta 
tttcgttcca 
tttttctgcg 
gtttgccgga 
agataccaaa 
tagcaccgcc 
ataagtcgtg 
cgggctgaac 
tgagatacct 
acaggtatcc 
gaaacgcctg 
ttttgtgatg 
tacggttcct 
attctgtgga 
cgaccgagcg 
tccttacgca 
ctgatgccgc 
agtgcgcgag 
atctgcttag 
tctgagggga 
gtcccctcag 
aatacacttg 
gaaaaagcac 
aaggcaacag 



caacatgggg 
accaaacgac 
attaactggc 
ggataaagtt 
taaatctgga 
taagccctcc 
aaatagacag 
agtttactca 
ggtgaagatc 
ctgagcgtca 
cgtaatctgc 
tcaagagcta 
tactgtcctt 
tacatacctc 
tcttaccggg 

ggggggttcg 

acagcgtgag 
ggtaagcggc 
gtatctttat 
ctcgtcaggg 
ggccttttgc 
taaccgtatt 
cagcgagtca 
tctgtgcggt 
atagttaagc 
caaaatttaa 
ggttaggcgt 
ctagggtgtg 
gatatagtag 
tagtcttgca 
cgtgcatgcc 

acgggtctga 



9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11600 



<210> 49 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5 pent on residues 337-344 



<400> 49 

His Ala lie Arg Gly Asp Thr Phe 
1 5 



<210> 50 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> PD1 penton mutation 
<400> 50 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser 
15 10 15 



<210> 51 
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<211> 35211 
<212> DNA 

<213> PArtificial Sequence 
<220> 

<2 23> Plasmid AvlnBg 



<400> 51 

catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 

agtgaaatct 

gactttgacc 

cgggtcaaag 

gcttggtacc 

atctgagggg 

agtcccctca 

caatactctt 

agaaaaagca 

gaaggcaaca 

tattgtattt 

tgtgcacctc 

ggctcgcggt 

tcggcctccg 

ggaaaacctc 

ccgggctgca 

gaccccaagg 

ttacaacgtc 

ccccctttcg 

ttgcgcagcc 

gaaagctggc 

cagatgcacg 

ccgccgtttg 

gaaagctggc 

catctgtggt 

tttgacctga 

tggagtgacg 

gacgtctcgt 

tttaatgatg 

cgtgactacc 

accgcgcctt 

ctacgtctga 

gcggtggttg 

ggtttccgcg 

attcgaggcg 

acgatggtgc 

cattatccga 

gatgaagcca 

cgctggctac 

ccgagtgtga 

ctgtatcgct 

gccgacacca 

cccttcccgg 

cgcccgctga 

aaatactggc 

gtggatcagt 

gattttggcg 

cgcacgccgc 

ttatccgggc 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
gtttacgtgg 
ttggcgtttt 
ggtgaattcg 
actagggtgt 
ggatatagta 
gtagtcttgc 
ccgtgcatgc 
gacgggtctg 
aagtgcctag 
cggccctggc 
tgaggacaaa 
aacggtactc 
tcgagaaagg 
ggaattcgat 
actttccttc 
gtgactggga 
ccagctggcg 
tgaatggcga 
tggagtgcga 
gttacgatgc 
ttcccacgga 
tacaggaagg 
gcaacgggcg 
gcgcattttt 
gcagttatct 
tgctgcataa 
atttcagccg 
tacgggtaac 
tcggcggtga 
acgtcgaaaa 
aactgcacac 
aggtgcggat 
ttaaccgtca 
aggatatcct 
accatccgct 
atattgaaac 
cggcgatgag 
tcatctggtc 
ggatcaaatc 
cggccaccga 
ctgtgccgaa 
tcctttgcga 
aggcgtttcg 
cgctgattaa 
atacgccgaa 
atccagcgct 
aaaccatcga 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
agactcgccc 
attattatag 
ctagcgttcg 
gtttaggcga 
gtttcgcttt 
aacatggtaa 
cgattggtgg 
acatggattg 
ctcgatacaa 
cactctcttc 
ctcttcgcgg 
cgccgccgag 
cgtgtaacca 
ctagatggat 
agaattgcta 
aaaccctggc 
taatagcgaa 
atggcgcttt 
tcttcctgag 
gcccatctac 
gaatccgacg 
ccagacgcga 

ctgggtcggt 

acgcgccgga 
ggaagatcag 
accgactaca 
cgctgtactg 
agtttcttta 
aattatcgat 
cccgaaactg 
cgccgacggc 
tgaaaatggt 
cgagcatcat 
gctgatgaag 
gtggtacacg 
ccacggcatg 
cgaacgcgta 
gctggggaat 
tgtcgatcct 
tattatttgc 
atggtccatc 
atacgcccac 
tcagtatccc 
atatgatgaa 
cgatcgccag 
gacggaagca 
agtgaccagc 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
aggtgttttt 
tcagtacgta 
cgccccgatg 
aaagcggggc 
tgcataggga 
cgatgagtta 
aagtaaggtg 
gacgaaccac 
taaacgccat 
cgcatcgctg 
tctttccagt 
ggacctgagc 
gtcacagtcg 
aaaggtccaa 
agttttttga 
gttacccaac 
gaggcccgca 
gcctggtttc 
gccgatactg 
accaacgtaa 
ggttgttact 
attatttttg 
tacggccagg 
gaaaaccgcc 
gatatgtggc 
caaatcagcg 
gaggctgaag 
tggcagggtg 
gagcgtggtg 
tggagcgccg 
acgctgattg 
ctgctgctgc 
cctctgcatg 
cagaacaact 
ctgtgcgacc 
gtgccaatga 
acgcgaatgg 
gaatcaggcc 
tcccgcccgg 
ccgatgtacg 
aaaaaatggc 
gcgatgggta 
cgtttacagg 
aacggcaacc 
ttctgtatga 
aaacaccagc 
gaatacctgt 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tatttgtcta 
ctcaggtgtt 
ccagtgcact 
tacgggccag 
ttcggttgta 

gggggaaatg 

gcaacatgcc 
gtacgatcgt 
tgaattccgc 
ttgaccattc 
tctgcggggg 
actcttggat 
gagtccgcat 
ctctagaact 
aaaagaagag 
gtgattcact 
ttaatcgcct 
ccgatcgccc 
cggcaccaga 
tcgtcgtccc 
cctatcccat 
cgctcacatt 
atggcgttaa 
acagtcgttt 
tcgcggtgat 
ggatgagcgg 
atttccatgt 
ttcagatgtg 
aaacgcaggt 
gttatgccga 
aaatcccgaa 
aagcagaagc 
tgaacggcaa 
gtcaggtcat 
ttaacgccgt 
gctacggcct 
atcgtctgac 
tgcagcgcga 
acggcgctaa 
tgcagtatga 
cgcgcgtgga 
tttcgctacc 
acagtcttgg 
gcggcttcgt 
cgtggtcggc 
acggtctggt 
agcagttttt 
tccgtcatag 



ggggtggagt 

gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
gggccgcggg 
ttccgcgttc 
ggcctaggaa 
atatacgcgt 
cgcggttagg 
tagtcttatg 
ttacaaggag 
gccttattag 
attgcagaga 
accacattgg 
ccagctgttg 
cggaaacccg 
cgaccggatc 
agtggatccc 
aaaggtagaa 
ggccgtcgtt 
tgcagcacat 
ttcccaacag 

a-gcggtgccg 

ctcaaactgg 
tacggtcaat 
taatgttgat 
ctcggcgttt 
gccgtctgaa 
ggtgctgcgt 
cattttccgt 
tgccactcgc 
cggcgagttg 
cgccagcggc 
tcgcgtcaca 
tctctatcgt 
ctgcgatgtc 
gccgttgctg 
ggatgagcag 
gcgctgttcg 
gtatgtggtg 
cgatgatccg 
tcgtaatcac 
tcacgacgcg 
aggcggcgga 
tgaagaccag 
tggagagacg 
cggtttcgct 
ctgggactgg 
ttacggcggt 
ctttgccgac 
ccagttccgt 
cgataacgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 
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ctcctgcact 
gatgtcgctc 
gccgggcaac 
gccgggcaca 
cccgccgcgt 
ctgggtaata 
ggcgataaaa 
aacgacattg 
aaggcggcgg 
gctgatgcgg 
atcagccgga 
gaagtggcga 
gtagcagagc 
actgccgcct 
ttcccgagcg 
tggcgcggcg 
agccatcgcc 
atggggattg 
gccggtcgct 
atcgataccg 
tcacaaattt 
tcatcaatgt 
atgagacccg 
ctgtgatgct 
gcgctgagtt 
gcttaagggt 
cagcagccgc 
tgacaacgcg 
atggtcgccc 
cgccgttgga 
ttgtgactga 
cccgcgatga 
atgtcgtttc 
cccctcccaa 
agcaagtgtc 
ggtctcggtc 
tgttcagata 
catgctgcgg 
taaaaatgtc 
caaagcggtt 
tttttaggtt 
ccagcacagt 
ggaagaactt 
tggcaatggg 
agttgtgttc 
agactgcggt 
ttcccacgct 
ggtttccggg 
accgcagccg 
gctgcagctg 
catgttttcc 
caaggaagca 
ttgaccaagc 
cagcatatct 
tcgtccagac 
tgggtcacgg 
gtcctgctgg 
accatggtgt 
gaggaggcgc 
aataccgatt 
acgagccagg 



ggatggtggc 
cacaaggtaa 
tctggctcac 
tcagcgcctg 
cccacgccat 
agcgttggca 
aacaactgct 
gcgtaagtga 
gccattacca 
tgctgattac 
aaacctaccg 
gcgatacacc 
gggtaaactg 
gttttgaccg 
aaaacggtct 
acttccagtt 
atctgctgca 
gtggcgacga 
accattacca 
tcgaaacttg 
cacaaataaa 
atcttatcat 
caccaggtgc 
ggatgtgacc 
tggctctagc 
gggaaagaat 
cgccgccatg 
catgccccca 
cgtcctgccc 
gactgcagcc 
ctttgctttc 
caagttgacg 
tcagcagctg 
tgcggtttaa 
ttgctgtctt 
gttgagggtc 
catgggcata 
ggtggtgttg 
tttcagtagc 
aagctgggat 
ggctatgttc 
gtatccggtg 
ggagacgccc 
cccacgggcg 
aggatgagat 
ataatggttc 
ttgagttcag 
gtaggggaga 
gtgggcccgt 
ccgtcatccc 
ctgaccaaat 
aagtttttca 
agttccaggc 
cctcgtttcg 
gggccagggt 
tgaaggggtg 
tgctgaagcg 
catagtccag 
cgcacgaggg 
ccggggagta 
tgagctctgg 



gctggatggt 
acagttgatt 
agtacgcgta 
gcagcagtgg 
cccgcatctg 
atttaaccgc 
gacgccgctg 
agcgacccgc 
ggccgaagca 
gaccgctcac 
gattgatggt 
gcatccggcg 
gctcggatta 
ctgggatctg 
gcgctgcggg 
caacatcagc 
cgcggaagaa 
ctcctggagc 
gttggtctgg 
tttattgcag 
gcattttttt 
gtctggatcc 
agaccctgcg 
gaggagctga 
gatgaagata 
atataaggtg 
agcaccaact 
tgggccgggg 
gcaaactcta 
tccgccgccg 
ctgagcccgc 
gctcttttgg 
ttggatctgc 
aacataaata 
tatttagggg 
ctgtgtattt 
agcccgtctc 
tagatgatcc 
aagctgattg 

gggtgcatac 

ccagccatat 
cacttgggaa 
ttgtgacctc 
gcggcctggg 
cgtcataggc 
catccggccc 
atggggggat 
tcagctggga 
aaatcacacc 
tgagcagggg 
ccgccagaag 
acggtttgag 
ggtcccacag 
cgggttgggg 
catgtctttc 
cgctccgggc 
ctgccggtct 
cccctccgcg 
gcagtgcaga 
ggcatccgcg 
ccgttcgggg 



aagccgctgg 
gaactgcctg 
gtgcaaccga 
cgtctggcgg 
accaccagcg 
cagtcaggct 
cgcgatcagt 
attgacccta 
gcgttgttgc 
gcgtggcagc 
agtggtcaaa 
cggattggcc 
gggccgcaag 
ccattgtcag 
acgcgcgaat 
cgctacagtc 
ggcacatggc 
ccgtcagtat 
tgtcaaaaat 
cttataatgg 
cactgcattc 
gacctcggat 
agtgtggcgg 
ggcccgatca 
cagattgagg 

ggggtcttat 

cgtttgatgg 
tgcgtcagaa 
ctaccttgac 
cttcagccgc 
ttgcaagcag 
cacaattgga 
gccagcaggt 
aaaaaccaga 
ttttgcgcgc 
tttccaggac 

tggggtggag 

agtcgtagca 
ccaggggcag 
gtggggatat 
ccctccgggg 
atttgtcatg 
caagattttc 
cgaagatatt 
catttttaca 
aggggcgtag 
catgtctacc 
agaaagcagg 
tattaccggg 
ggccacttcg 
gcgctcgccg 
accgtccgcc 
ctcggtcacc 
cggctttcgc 
cacgggcgca 
tgcgcgctgg 
tcgccctgcg 
gcgtggccct 
cttttgaggg 
ccgcaggccc 
tcaaaaacca 



caagcggtga 
aactaccgca 
acgcgaccgc 
aaaacctcag 
aaatggattt 
ttctttcaca 
tcacccgtgc 
acgcctgggt 
agtgcacggc 
atcaggggaa 

tggcgattac 

tgaactgcca 
aaaactatcc 
acatgtatac 
tgaattatgg 
aacagcaact 
tgaatatcga 
cggcggaatt 
aataatctcg 
ttacaaataa 
tagttgtggt 
ctggaaggtg 
taaacatatt 
cttggtgctg 
tactgaaatg 
gtagttttgt 
aagcattgtg 
tgtgatgggc 
ctacgagacc 
tgcagccacc 
tgcagcttcc 
ttctttgacc 
ttctgccctg 
ctctgtttgg 
gcggtaggcc 
gtggtaaagg 
gtagcaccac 
ggagcgctgg 
gcccttggtg 
gagatgcatc 
attcatgttg 
tagcttagaa 
catgcattcg 
tctgggatca 
aagcgcgggc 
ttaccctcac 
tgcggggcga 
ttcctgagca 
tgcaactggt 
ttaagcatgt 
cccagcgata 
gtaggcatgc 
tgctctacgg 
tgtacggcag 
gggtcctcgt 
ccagggtgcg 
cgtcggccag 
tggcgcgcag 
cgtagagctt 
cgcagacggt 
ggtttccccc 



agtgcctctg 
gccggagagc 
atggtcagaa 
tgtgacgctc 
ttgcatcgag 
gatgtggatt 
accgctggat 
cgaacgctgg 
agatacactt 
aaccttattt 
cgttgatgtt 
gctggcgcag 
cgaccgcctt 
cccgtacgtc 
cccacaccag 
gatggaaacc 
cggtttccat 
t cage t gage 
aatcaagctt 
agcaatagca 
ttgtccaaac 
ctgaggtacg 
aggaaccagc 
gcctgcaccc 
tgtgggcgtg 
atctgttttg 
agctcatatt 
tccagcattg 
gtgtctggaa 
gcccgcggga 
cgttcatccg 
egggaactta 
aaggcttcct 
atttggatca 
cgggaccagc 
tgactctgga 
tgeagagett 
gcgtggtgcc 
taagtgttta 
ttggactgta 
tgcagaacca 
ggaaatgcgt 
tccataatga 
etaaegtcat 
ggagggtgcc 
agatttgeat 
tgaagaaaac 
getgegaett 
agttaagaga 
ccctgactcg 
gcagttcttg 
ttttgagcgt 
catctcgatc 
tagtcggtgc 
cagegtagtc 
cttgaggctg 
gtagcatttg 
cttgcccttg 
gggegegaga 
ctcgcattcc 
atgctttttg 



3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
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6840 
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atgcgtttct tacctctggt ttccatgagc cggtgtccac gctcggtgac gaaaaggctg 69 0 0 
tccgtgtccc cgtatacaga cttgagaggc ctgtcctcga gcggtgttcc gcggtcctcc 6960 
tcgtatagaa actcggacca ctctgagaca aaggctcgcg tccaggccag cacgaaggag 702 0 
gctaagtggg aggggtagcg gtcgttgtcc actagggggt ccactcgctc cagggtgtga 7080 
agacacatgt cgccctcttc ggcatcaagg aaggtgattg gtttgtaggt gtaggccacg 7140 
tgaccgggtg ttcctgaagg ggggctataa aagggggtgg gggcgcgttc gtcctcactc 72 0 0 
tcttccgcat cgctgtctgc gagggccagc tgttggggtg agtactccct ctgaaaagcg 7260 
ggcatgactt ctgcgctaag attgtcagtt tccaaaaacg aggaggattt gatattcacc 7320 
tggcccgcgg tgatgccttt gagggtggcc gcatccatct ggtcagaaaa gacaatcttt 73 80 
ttgttgtcaa gcttggtggc aaacgacccg tagagggcgt tggacagcaa cttggcgatg 744 0 
gagcgcaggg tttggttttt gtcgcgatcg gcgcgctcct tggccgcgat gfcttagctgc 7500 
acgtattcgc gcgcaacgca ccgccattcg ggaaagacgg tggtgcgctc gtcgggcacc 7560 
aggtgcacgc gccaaccgcg gttgtgcagg gtgacaaggt caacgctggt ggctacctct 762 0 
ccgcgtaggc gctcgttggt ccagcagagg cggccgccct tgcgcgagca gaatggcggt 7680 
agggggtcta gctgcgtctc gtccgggggg tctgcgtcca cggtaaagac cccgggcagc 7740 
aggcgcgcgt cgaagtagtc tatcttgcat ccttgcaagt ctagcgcctg ctgccatgcg 7800 
cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat ggggtgggtg 78 60 
agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct gagtattcca 7920 
agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc gtatagttcg 7980 
tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc tgctcggaag 8 04 0 
actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg gaagacgttg 8100 
aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga gtcgcgcagc 8160 
ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag ggtttccttg 822 0 
atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag gacaaactct 82 80 
tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg gtaagagcct 8340 
agcatgtaga actggttgac ggcctggtag gcgcagcatc ccttttctac gggtagcgcg 8400 
tatgcctgcg cggccttccg gagcgaggtg tgggtgagcg caaaggtgtc cctgaccatg 8460 
actttgaggt actggtattt gaagtcagtg tcgtcgcatc cgccctgctc ccagagcaaa 852 0 
aagtccgtgc gctttttgga acgcggattt ggcagggcga aggtgacatc gttgaagagt 8580 
atctttcccg cgcgaggcat aaagttgcgt gtgatgcgga agggtcccgg cacctcggaa 8640 
cggttgttaa ttacctgggc ggcgagcacg atctcgtcaa agccgttgat gttgtggccc 8700 
acaatgtaaa gttccaagaa gcgcgggatg cccttgatgg aaggcaattt tttaagttcc 8760 
tcgtaggtga gctcttcagg ggagctgagc ccgtgctctg aaagggccca gtctgcaaga 8 82 0 
tgagggttgg aagcgacgaa tgagctccac aggtcacggg ccattagcat ttgcaggtgg 88 8 0 
tcgcgaaagg tcctaaactg gcgacctatg gccatttttt ctggggtgat gcagtagaag 8 94 0 
gtaagcgggt cttgttccca gcggtcccat ccaaggttcg cggctaggtc tcgcgcggca 9000 
gtcactagag gctcatctcc gccgaacttc atgaccagca tgaagggcac gagctgcttc 9060 
ccaaaggccc ccatccaagt ataggtctct acatcgtagg tgacaaagag acgctcggtg 912 0 
cgaggatgcg agccgatcgg gaagaactgg atctcccgcc accaattgga ggagtggcta 9180 
ttgatgtggt gaaagtagaa gtccctgcga cgggccgaac actcgtgctg gcttttgtaa 9240 
aaacgtgcgc agtactggca gcggtgcacg ggctgtacat cctgcacgag gttgacctga 93 0 0 
cgaccgcgca caaggaagca gagtgggaat ttgagcccct cgcctggcgg gtttggctgg 93 60 
tggtcttcta cttcggctgc ttgtccttga ccgtctggct gctcgagggg agttacggtg 942 0 
gatcggacca ccacgccgcg cgagcccaaa gtccagatgt ccgcgcgcgg cggtcggagc 94 8 0 
ttgatgacaa catcgcgcag atgggagctg tccatggtct ggagctcccg cggcgtcagg 954 0 
tcaggcggga gctcctgcag gtttacctcg catagacggg tcagggcgcg ggctagatcc 9600 
aggtgatacc taatttccag gggctggttg gtggcggcgt cgatggcttg caagaggccg 9660 
catccccgcg gcgcgactac ggtaccgcgc ggcgggcggt gggccgcggg ggtgtccttg 972 0 
gatgatgcat ctaaaagcgg tgacgcgggc gagcccccgg aggtaggggg ggctccggac 9780 
ccgccgggag agggggcagg ggcacgtcgg cgccgcgcgc gggcaggagc tggtgctgcg 984 0 
cgcgtaggtt gctggcgaac gcgacgacgc ggcggttgat ctcctgaatc tggcgcctct 990 0 
gcgtgaagac gacgggcccg gtgagcttga gcctgaaaga gagttcgaca gaatcaattt 996 0 
cggtgtcgtt gacggcggcc tggcgcaaaa tctcctgcac gtctcctgag ttgtcttgat 10020 
aggcgatctc ggccatgaac tgctcgatct cttcctcctg gagatctccg cgtccggctc 10080 
gctccacggt ggcggcgagg tcgttggaaa tgcgggccat gagctgcgag aaggcgttga 10140 
ggcctccctc gttccagacg cggctgtaga ccacgccccc ttcggcatcg cgggcgcgca 102 0 0 
tgaccacctg cgcgagattg agctccacgt gccgggcgaa gacggcgtag tttcgcaggc 102 60 
gctgaaagag gtagttgagg gtggtggcgg tgtgttctgc cacgaagaag tacataaccc 10320 
agcgtcgcaa cgtggattcg ttgatatccc ccaaggcctc aaggcgctcc atggcctcgt 103 80 
agaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgacacggtt aactcctcct 10440 
ccagaagacg gatgagctcg gcgacagtgt cgcgcacctc gcgctcaaag gctacagggg 10500 
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cctcttcttc 
gcggtggggg 
gctcgatcat 

gggggcgcag 

catgcggcag 
cgccgaggga 
ctaaccagtc 
cggggttgtt 
ggcggatggt 
cggccatgcc 
gcctttctac 
ctgcggcggc 
cgaagcccct 
cctgctgcac 
cgcccgtgtt 
ccggctgcga 
cgttgcaagt 
agaggggcca 
gatatccgta 
gaaagtcgcg 
cgctctggcc 
tgtaagcggg 
accggggttc 
gtcgaaccca 
gcgcggcggc 
gctggaaagc 
gttgagtcgc 
ttgcctcccc 
ttttttgctt 
ggcaagagca 
gaggggcgac 
gggcccggca 
ctcctgagcg 
ggcagaacct 
tccacgcagg 
actttgagcc 
acctggtaac 
acaaccacgt 
gggactttgt 
tccttatagt 
tagagcccga 
aggagcgcag 
tgggcaagtt 
taaagatcga 
tgggcgttta 
tcagcgaccg 
atagagaggc 
gcgccctgga 
acgtcggcgg 
actaagcggt 
ggcgctgcag 
ccgcatcatg 
ccggctctcc 
ggtgctggcg 
cctggtctac 
caacctggac 
gcagcagggc 
caacgtgccg 
gactgagaca 
tagacaaggc 

gggggtgcgg 



ttcttcaatc 
aggggggaca 
ctccccgcgg 
ttggaagacg 
ggatacggcg 
cctgagcgag 
acagtcgcaa 
tctggcggag 
cgacagaagc 
ccaggcttcg 
cggcacttct 
ggcggagttt 
catcggctga 
ctgcgtgagg 
gatggtgtaa 
gagctcggtg 
ccgcaccagg 
gcgtagggtg 
gatgtacctg 
gacgcggttc 
ggtcaggcgc 
cactcttccg 
gagccccgta 
ggtgtgcgac 
tgctgcgcta 
gaaagcatta 
gggacccccg 
gtcatgcaag 
ttcccagatg 
agagcagcgg 
atccgcggtt 
ctacctggac 
gtacccaagg 
gtttcgcgac 
gcgcgagctg 
cgacgcgcga 
cgcatacgag 
gcgtacgctt 
aagcgcgctg 
gcagcacagc 
gggccgctgg 
cttgagcctg 
ttacgcccgc 
ggggttctac 
tcgcaacgag 
cgagctgatg 
cgagtcctac 
ggcagctggg 
cgtggaggaa 
gatgtttctg 
agccagccgt 
tcgctgactg 
gcaattctgg 
atcgtaaacg 
gacgcgctgc 
cggctggtgg 
aacctgggct 
cggggacagg 
ccgcaaagtg 
ctgcagaccg 
gctcccacag 



tcctcttcca 
cggcggcgac 
cgacggcgca 
ccgcccgtca 
ctaacgatgc 
tccgcatcga 
ggtaggctga 
gtgctgctga 
accatgtcct 
ttttgacatc 
tcttctcctt 
ggccgtaggt 
agcagggcta 
gtagactgga 
gtgcagttgg 
tacctgagac 
tactggtatc 
gccggggctc 
gacatccagg 
cagatgttgc 
gcgcaatcgt 
tggtctggtg 
tccggccgtc 
gtcagacaac 
gcttttttgg 
agtggctcgc 
gttcgagtct 
accccgcttg 
catccggtgc 
cagacatgca 
gacgcggcag 
ttggaggagg 
gtgcagctga 
cgcgagggag 
cggcatggcc 
accgggatta 
cagacggtga 
gtggcgcgcg 
gagcaaaacc 
agggacaacg 
ctgctcgatt 
gctgacaagg 
aagatatacc 
atgcgcatgg 
cgcatccaca 
cacagcctgc 
tttgacgcgg 
gccggacctg 
tatgacgagg 
atcagatgat 
ccggccttaa 
cgcgcaatcc 
aagcggtggt 
cgctggccga 
ttcagcgcgt 
gggatgtgcg 
ccatggttgc 
aggactacac 
aggtgtacca 
taaacctgag 
gcgaccgcgc 



taagggcctc 
gacggcgcac 
tggtctcggt 
tgtcccggtt 
atctcaacaa 
ccggatcgga 
gcaccgtggc 
tgatgtaatt 
tgggtccggc 
ggcgcaggtc 
cctcttgtcc 
ggcgccctct 
ggtcggcgac 
agtcatccat 
ccataacgga 
gcgagtaagc 
ccaccaaaaa 

cgggggcgag 
tgatgccggc 
gcagcggcaa 
tgacgctcta 
gataaattcg 
cgccgtgatc 

gggggagtgc 

ccactggccg 
tccctgtagc 
cggaccggcc 
caaattcctc 
tgcggcagat 
gggcaccctc 
cagatggtga 
gcgagggcct 
agcgtgatac 
aggagcccga 
tgaatcgcga 
gtcccgcgcg 
accaggagat 
aggaggtggc 
caaatagcaa 
aggcattcag 
tgataaacat 
tggccgccat 
atacccctta 
cgctgaaggt 
aggccgtgag 
aaagggccct 
gcgctgacct 
ggctggcggt 
acgatgagta 
gcaagacgca 
ctccacggac 
tgacgcgttc 
cccggcgcgc 
aaacagggcc 
ggctcgttac 
cgaggccgtg 
actaaacgcc 
caactttgtg 
gtctgggcca 
ccaggctttc 
gaccgtgtct 



cccttcttct 
cgggaggcgg 
gacggcgcgg 
atgggttggc 
ttgttgtgta 
aaacctctcg 

gggcggcagc 

aaagtaggcg 
ctgctgaatg 
tttgtagtag 
tgcatctctt 
tcctcccatg 
aacgcgctcg 
gtccacaaag 
ccagttaacg 
cctcgagtca 
gtgcggcggc 
atcttccaac 
ggcggtggtg 
aaagtgctcc 
gaccgtgcaa 
caagggtatc 
catgcggtta 
tccttttggc 
cgcgcagcgt 
cggagggtta 
ggactgcggc 
cggaaacagg 
gcgcccccct 
ccctcctcct 
ttacgaaccc 
ggcgcggcta 
gcgtgaggcg 
ggagatgcgg 
gcggttgctg 
cgcacacgtg 
taactttcaa 
tataggactg 
gccgctcatg 
ggatgcgctg 
cctgcagagc 
caactattcc 
cgttcccata 
gcttaccttg 
cgtgagccgg 
ggctggcacg 
gcgctgggcc 
ggcacccgcg 
cgagccagag 
acggacccgg 
gactggcgcc 
cggcagcagc 
gcaaacccca 
atccggcccg 
aacagcggca 
gcgcagcgtg 
ttcctgagta 
agcgcactgc 
gactattttt 
aaaaacttgc 
agcttgctga 



tcttctggcg 
tcgacaaagc 
ccgttctcgc 

ggggggctgc 

ggtactccgc 
agaaaggcgt 

gggcggcggt 

gtcttgagac 
cgcaggcggt 
tcttgcatga 
gcatctatcg 
cgtgtgaccc 
gctaatatgg 
cggtggtatg 
gtctggtgac 
aatacgtagt 
ggctggcggt 
ataaggcgat 
gaggcgcgcg 
atggtcggga 
aaggagagcc 
atggcggacg 
ccgcccgcgt 
ttccttccag 
aagcggttag 
ttttccaagg 
gaacgggggt 
gacgagcccc 
cctcagcagc 
accgcgtcag 
ccgcggcgcc 
ggagcgccct 
tacgtgccgc 
gatcgaaagt 
cgcgaggagg 
gcggccgccg 
aaaagcttta 
atgcatctgt 
gcgcagctgt 
ctaaacatag 
atagtggtgc 
atgcttagcc 
gacaaggagg 
agcgacgacc 
cggcgcgagc 
ggcagcggcg 
ccaagccgac 
cgcgctggca 
gacggcgagt 
cggtgcgggc 
aggtcatgga 
cgcaggccaa 
cgcacgagaa 
acgaggccgg 
acgtgcagac 
agcgcgcgca 
cacagcccgc 
ggctaatggt 
tccagaccag 
aggggctgtg 
cgcccaactc 
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gcgcctgttg ctgctgctaa tagcgccctt cacggacagt ggcagcgtgt cccgggacac 1422 0 
atacctaggt cacttgctga cactgtaccg cgaggccata ggtcaggcgc atgtggacga 14280 
gcatactttc caggagatta caagtgtcag ccgcgcgctg gggcaggagg acacgggcag 1434 0 
cctggaggca accctaaact acctgctgac caaccggcgg cagaagatcc cctcgttgca 144 0 0 
cagtttaaac agcgaggagg agcgcatttt gcgctacgtg cagcagagcg tgagccttaa 1446 0 
cctgatgcgc gacggggtaa cgcccagcgt ggcgctggac atgaccgcgc gcaacatgga 14520 
accgggcatg tatgcctcaa accggccgtt tatcaaccgc ctaatggact acttgcatcg 14580 
cgcggccgcc gtgaaccccg agtatttcac caatgccatc ttgaacccgc actggctacc 14640 
gccccctggt ttctacaccg ggggattcga ggtgcccgag ggtaacgatg gattcctctg 14700 
ggacgacata gacgacagcg tgttttcccc gcaaccgcag accctgctag agttgcaaca 14760 
gcgcgagcag gcagaggcgg cgctgcgaaa ggaaagcttc cgcaggccaa' gcagcttgtc 14 82 0 
cgatctaggc gctgcggccc cgcggtcaga tgctagtagc ccatttccaa gcttgatagg 1488 0 
gtctcttacc agcactcgca ccacccgccc gcgcctgctg ggcgaggagg agtacctaaa 1494 0 
caactcgctg ctgcagccgc agcgcgaaaa aaacctgcct ccggcatttc ccaacaacgg 15000 
gatagagagc ctagtggaca agatgagtag atggaagacg tacgcgcagg agcacaggga 15060 
cgtgccaggc ccgcgcccgc ccacccgtcg tcaaaggcac gaccgtcagc ggggtctggt 1512 0 
gtgggaggac gatgactcgg cagacgacag cagcgtcctg gatttgggag ggagtggcaa 1518 0 
cccgtttgcg caccttcgcc ccaggctggg gagaatgttt taaaaaaaaa aaagcatgat 1524 0 
gcaaaataaa aaactcacca aggccatggc accgagcgtt ggttttcttg tattcccctt 15300 
agtatgcggc gcgcggcgat gtatgaggaa ggtcctcctc cctcctacga gagtgtggtg 153 6 0 
agcgcggcgc cagtggcggc ggcgctgggt tctcccttcg atgctcccct ggacccgccg 1542 0 
tttgtgcctc cgcggtacct gcggcctacc ggggggagaa acagcatccg ttactctgag 15480 
ttggcacccc tattcgacac cacccgtgtg tacctggtgg acaacaagtc aacggatgtg 1554 0 
gcatccctga actaccagaa cgaccacagc aactttctga ccacggtcat tcaaaacaat 1560 0 
gactacagcc cgggggaggc aagcacacag accatcaatc ttgacgaccg gtcgcactgg 15660 
ggcggcgacc tgaaaaccat cctgcatacc aacatgccaa atgtgaacga gttcatgttt 1572 0 
accaataagt ttaaggcgcg ggtgatggtg tcgcgcttgc ctactaagga caatcaggtg 157 8 0 
gagctgaaat acgagtgggt ggagttcacg ctgcccgagg gcaactactc cgagaccatg 1584 0 
accatagacc ttatgaacaa cgcgatcgtg gagcactact tgaaagtggg cagacagaac 1590 0 
gggG'ttctgg aaagcgacat cggggtaaag tttgacaccc gcaacttcag actggggttt 1596 0 
gaccccgtca ctggtcttgt catgcctggg gtatatacaa acgaagcctt ccatccagac 16020 
atcattttgc tgccaggatg cggggtggac ttcacccaca gccgcctgag caacttgttg 160 8 0 
ggcatccgca agcggcaacc cttccaggag ggctttagga tcacctacga tgatctggag 16140 
ggtggtaaca ttcccgcact gttggatgtg gacgcctacc aggcgagctt gaaagatgac 162 0 0 
accgaacagg gcgggggtgg cgcaggcggc agcaacagca gtggcagcgg cgcggaagag 162 6 0 
aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaacga tcatgccatt 163 2 0 
cgcggcgaca cctttgccac acgggctgag gagaagcgcg ctgaggccga agcagcggcc 163 8 0 
gaagctgccg cccccgctgc gcaacccgag gtcgagaagc ctcagaagaa accggtgatc 1644 0 
aaacccctga cagaggacag caagaaacgc agttacaacc taataagcaa tgacagcacc 1650 0 
ttcacccagt accgcagctg gtaccttgca tacaactacg gcgaccctca gaccggaatc 16560 
cgctcatgga ccctgctttg cactcctgac gtaacctgcg gctcggagca ggtctactgg 16 62 0 
tcgttgccag acatgatgca agaccccgtg accttccgct ccacgcgcca gatcagcaac 16680 
tttccggtgg tgggcgccga gctgttgccc gtgcactcca agagcttcta caacgaccag 1674 0 
gccgtctact cccaactcat ccgccagttt acctctctga cccacgtgtt caatcgcttt 16800 
cccgagaacc agattttggc gcgcccgcca gcccccacca tcaccaccgt cagtgaaaac 16 86 0 
gttcctgctc tcacagatca cgggacgcta ccgctgcgca acagcatcgg aggagtccag 16 92 0 
cgagtgacca ttactgacgc cagacgccgc acctgcccct acgtttacaa ggccctgggc 1698 0 
atagtctcgc cgcgcgtcct atcgagccgc actttttgag caagcatgtc catccttata 17 04 0 
tcgcccagca ataacacagg ctggggcctg cgcttcccaa gcaagatgtt tggcggggcc 1710 0 
aagaagcgct ccgaccaaca cccagtgcgc gtgcgcgggc actaccgcgc gccctggggc 17160 
gcgcacaaac gcggccgcac tgggcgcacc accgtcgatg acgccatcga cgcggtggtg 1722 0 
gaggaggcgc gcaactacac gcccacgccg ccaccagtgt ccacagtgga cgcggccatt 172 8 0 
cagaccgtgg tgcgcggagc ccggcgctat gctaaaatga agagacggcg gaggcgcgta 173 4 0 
gcacgtcgcc accgccgccg acccggcact gccgcccaac gcgcggcggc ggccctgctt 17400 
aaccgcgcac gtcgcaccgg ccgacgggcg gccatgcggg ccgctcgaag gctggccgcg 17460 
ggtattgtca ctgtgccccc caggtccagg cgacgagcgg ccgccgcagc agccgcggcc 1752 0 
attagtgcta tgactcaggg tcgcaggggc aacgtgtatt gggtgcgcga ctcggttagc 1758 0 
ggcctgcgcg tgcccgtgcg cacccgcccc ccgcgcaact agattgcaag aaaaaactac 17640 
ttagactcgt actgttgtat gtatccagcg gcggcggcgc gcaacgaagc tatgtccaag 17700 
cgcaaaatca aagaagagat gctccaggtc atcgcgccgg agatctatgg ccccccgaag 17760 
aaggaagagc aggattacaa gccccgaaag ctaaagcggg tcaaaaagaa aaagaaagat 17820 
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gatgatgatg 
gtacagtgga 
acgcccggtg 
gaggacctgc 
aaggacatgc 
acactgcagc 
gagtctggtg 
gatgtcttgg 
atcaagcagg 
agtagcacca 
tcagcggtgg 
gaggtgcaaa 
aggaagtacg 
cctacccccg 
cgaaccacca 
tccgtgcgca 
caccccagca 
cgcctccgtt 
ggccacggcc 
cgtcgcatgc 
gccgtgcccg 
tgcatgtgga 
attttgtaga 
gttcatggga 
gggctcgctg 
ggcctggaac 
acaaaaggtg 
ggcagtgcaa 
accggccgtg 
ggaagaaact 
aggcctgccc 
acccgtaacg 
cccgaccgcc 
tccgcgatcg 
gggtctgggg 
tgtgtgtcat 
ctttccaaga 
caggacgcct 
tacttcagcc 
acagaccggt 
tactcgtaca 
tccacgtact 
ggcactgcct 
gctgctactg 
gacgagcaag 
aatattacaa 
aaaacatttc 
catgcagctg 
gcaaaaccca 
ctagaaagtc 
gataacttga 
actcatattt 
caatctatgc 
tattacaaca 
gttgtagatt 
ggtgatagaa 
gttagaatta 
ggaggtgtga 
ggatgggaaa 
tttgccatgg 
gcgctgtatt 



aacttgacga 
aaggtcgacg 
agcgctccac 
ttgagcaggc 
tggcgttgcc 
aggtgctgcc 
acttggcacc 
aaaaaatgac 
tggcgccggg 
gtattgccac 
cggatgccgc 
cggacccgtg 
gcgccgccag 
gctatcgtgg 
ctggaacccg 
gggtggctcg 
tcgtttaaaa 
tcccggtgcc 
tgacgggcgg 
gcggcggtat 
gaattgcatc 
aaaatcaaaa 
atggaagaca 
aactggcaag 
tggagcggca 
agcagcacag 
gtagatggcc 
aataagatta 
gagacagtgt 
ctggtgacgc 
accacccgtc 
ctggacctgc 
gttgttgtaa 
ttgcggcccg 
gtgcaatccc 
gtatgcgtcc 
tggctacccc 
cggagtacct 
tgaataacaa 
cccagcgttt 

aggcgcggtt 

ttgacatccg 
acaacgccct 
ctcttgaaat 
ctgagcagca 
aggagggtat 
aacctgaacc 
ggagagtcct 
caaatgaaaa 
aagtggaaat 
ctcctaaagt 
cttacatgcc 
ccaacaggcc 
gcacgggtaa 
tgcaagacag 
ccaggtactt 
ttgaaaatca 
ttaatacaga 
aagatgctac 
aaatcaatct 
tgcccgacaa 



cgaggtggaa 
cgtaaaacgt 
ccgcacctac 
caacgagcgc 
gctggacgag 
cgcgcttgca 
caccgtgcag 
cgtggaacct 
actgggcgtg 
cgccacagag 
ggtgcaggcg 
gatgtttcgc 
cgcgctactg 
ctacacctac 
ccgccgccgt 
cgaaggaggc 
gccggtcttt 
gggattccga 
catgcgtcgt 
cctgcccctc 
cgtggccttg 
taaaaagtct 
tcaactttgc 
atatcggcac 
ttaaaaattt 
gccagatgct 
tggcctctgg 
acagtaagct 
ctccagaggg 
aaatagacga 
ccatcgcgcc 
ctccccccgc 
cccgtcctag 
tagccagtgg 
tgaagcgccg 
atgtcgccgc 
ttcgatgatg 
gagccccggg 
gtttagaaac 
gacgctgcgg 
caccctagct 
cggcgtgctg 
ggctcccaag 
aaacctagaa 
aaaaactcac 
tcaaataggt 
tcaaatagga 
taaaaagact 
tggagggcaa 
gcaatttttc 
ggtattgtac 
cactattaag 
taattacatt 
tatgggtgtt 
aaacacagag 
ttctatgtgg 
tggaactgaa 
gactcttacc 
agaattttca 
aaatgccaac 
gctaaagtac 



ctgctgcacg 
gttttgcgac 
aagcgcgtgt 
ctcggggagt 
ggcaacccaa 
ccgtccgaag 
ctgatggtac 
gggctggagc 
cagaccgtgg 
ggcatggaga 
gtcgctgcgg 
gtttcagccc 
cccgaatatg 
cgccccagaa 
cgccgtcgcc 
aggaccctgg 
gtggttcttg 
ggaagaatgc 
gcgcaccacc 
cttattccac 
caggcgcaga 
ggactctcac 
gtctctggcc 
cagcaatatg 
cggttccacc 
gagggataag 
cattagcggg 
tgatccccgc 
gcgtggcgaa 
gcctccctcg 
catggctacc 
cgacacccag 
ccgcgcgtcc 
caactggcaa 
acgatgcttc 
cagaggagct 
ccgcagtggt 
ctggtgcagt 
cccacggtgg 
ttcatccctg 
gtgggtgata 
gacaggggcc 
ggtgccccaa 
gaagaggacg 
gtatttgggc 
gtcgaaggtc 
gaatctcagt 
accccaatga 
ggcattcttg 
tcaactactg 
agtgaagatg 
gaaggtaact 
gcttttaggg 
ctggcgggcc 
ctttcatacc 
aatcaggctg 
gatgaacttc 
aaggtaaaac 
gataaaaatg 
ctgtggagaa 
agtccttcca 



ctaccgcgcc 
ccggcaccac 
atgatgaggt 
ttgcctacgg 
cacctagcct 
aaaagcgcgg 
ccaagcgcca 
ccgaggtccg 
acgttcagat 
cacaaacgtc 
ccgcgtccaa 
cccggcgccc 
ccctacatcc 
gacgagcaac 
agcccgtgct 
tgctgccaac 
cagatatggc 
accgtaggag 
ggcggcggcg 
tgatcgccgc 
gacactgatt 
gctcgcttgg 
ccgcgacacg 
agcggtggcg 
gttaagaact 
ttgaaagagc 
gtggtggacc 
cctcccgtag 
aagcgtccgc 
tacgaggagg 
ggagtgctgg 
cagaaacctg 
ctgcgccgcg 
agcacactga 
tgaatagcta 
gctgagccgc 
cttacatgca 
ttgcccgcgc 
cgcctacgca 
tggaccgtga 
accgtgtgct 
ctacttttaa 
atccttgcga 
atgacaacga 
aggcgcctta 
aaacacctaa 
ggtacgaaac 
aaccatgtta 
taaagcaaca 
aggcgaccgc 
tagatataga 
cacgagaact 
acaattttat 
aagcatcgca 
agcttttgct 
ttgacagcta 
caaattactg 
ctaaaacagg 
aaataagagt 
atttcctgta 
acgtaaaaat 



caggcgacgg 
cgtagtcttt 
gtacggcgac 
aaagcggcat 
aaagcccgta 
cctaaagcgc 
gcgactggaa 
cgtgcggcca 
acccactacc 
cccggttgcc 
gacctctacg 
gcgcggttcg 
ttccattgcg 
tacccgacgc 
ggccccgatt 
agcgcgctac 
cctcacctgc 
gggcatggcc 
cgcgtcgcac 
ggcgattggc 
aaaaacaagt 
tcctgtaact 
gctcgcgccc 
ccttcagctg 
atggcagcaa 
aaaatttcca 
tggccaacca 
aggagcctcc 
gccccgacag 
cactaaagca 
gccagcacac 
tgctgccagg 
ccgccagcgg 
acagcatcgt 
acgtgtcgta 
cgcgcgcccg 
catctcgggc 
caccgagacg 
cgacgtgacc 
ggatactgcg 
ggacatggct 
gccctactct 
atgggatgaa 
agacgaagta 
ttctggtata 
atatgccgat 
tgaaattaat 
cggttcatat 
aaatggaaag 
aggcaatggt 
aaccccagac 
aatgggccaa 
tggtctaatg 
gttgaatgct 
tgattccatt 
tgatccagat 
ctttccactg 
tcaggaaaat 
tggaaataat 
ctccaacata 
ttctgataac 



17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
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21180 
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21480 



WO 2004/099422 



PCT/US2004/009219 



-85- 



ccaaacacct acgactacat gaacaagcga gtggtggctc ccgggttagt ggactgctac 2154 0 
attaaccttg gagcacgctg gtcccttgac tatatggaca acgtcaaccc atttaaccac 21600 
caccgcaatg ctggcctgcg ctaccgctca atgttgctgg gcaatggtcg ctatgtgccc 21660 
ttccacatcc aggtgcctca gaagttcttt gccattaaaa acctccttct cctgccgggc 2172 0 
tcatacacct acgagtggaa cttcaggaag gatgttaaca tggttctgca gagctcccta 21780 
ggaaatgacc taagggttga cggagccagc attaagtttg atagcatttg cctttacgcc 21840 
accttcttcc ccatggccca caacaccgcc tccacgcttg aggccatgct tagaaacgac 2190 0 
accaacgacc agtcctttaa cgactatctc tccgccgcca acatgctcta ccctataccc 21960 
gccaacgcta ccaacgtgcc catatccatc ccctcccgca actgggcggc tttccgcggc 22 02 0 
tgggccttca cgcgccttaa gactaaggaa accccatcac tgggctcggg ctacgaccct 22 0 80 
tattacacct actctggctc tataccctac ctagatggaa ccttttacct caaccacacc 22140 
tttaagaagg tggccattac ctttgactct tctgtcagct ggcctggcaa tgaccgcctg 222 0 0 
cttaccccca acgagtttga aattaagcgc tcagttgacg gggagggtta caacgttgcc 22260 
cagtgtaaca tgaccaaaga ctggttcctg gtacaaatgc tagctaacta caacattggc 22 32 0 
taccagggct tctatatccc agagagctac aaggaccgca tgtactcctt ctttagaaac 22380 
ttccagccca tgagccgtca ggtggtggat gatactaaat acaaggacta ccaacaggtg 22440 
ggcatcctac accaacacaa caactctgga tttgttggct accttgcccc caccatgcgc 22500 
gaaggacagg cctaccctgc taacttcccc tatccgctta taggcaagac cgcagttgac 22560 
agcattaccc agaaaaagtt tctttgcgat cgcacccttt ggcgcatccc attctccagt 22 62 0 
aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 22680 
gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 22740 
gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 22 80 0 
accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 22 86 0 
catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 2 2 920 
atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 22980 
ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 2 3 040 
actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 23100 
gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 23160 
gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 23 22 0 
tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 232 80 
ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 23340 
ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 23 40 0 
tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 23460 
gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 23520 
ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 23580 
tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 23 64 0 
acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca 23 70 0 
gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 23760 
gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 23 82 0 
cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 23880 
tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 23 94 0 
ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 240 00 
gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 24060 
ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 2412 0 
acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 24180 
tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 2424 0 
actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 243 00 
ccttatttat cataatgctt ccgtgfcagac acttaagctc gccttcgatc tcagcgcagc 24360 
ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 2442 0 
actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 24480 
aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 24540 
cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 24600 
tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 24660 
tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 24720 
gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 24780 
ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 24840 
catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 24900 
tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 24960 
atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 25020 
cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 250 8 0 
gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 25140 
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cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 2 5200 
aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 25260 
ccaccgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 25320 
ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 253 80 
accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 25440 
aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 25500 
tgttgaagca tctgcagcgc cagtgcgcca ttatctgcga cgcgttgcaa gagcgcagcg 25560 
atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 25620 
gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 25 68 0 
accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 25740 
agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 25 800 
agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 25 86 0 
ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 25 92 0 
gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 25980 
gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 2 6 040 
gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 2610 0 
tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 26160 
ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc 2622 0 
tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca 26280 
agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga 26340 
tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc 26400 
ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg 26460 
actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt 2 652 0 
gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc 26580 
tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg 26640 
aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgttgc 26700 
agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac 26760 
ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact 26820 
gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga 2 68 80 
gcggtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc 26940 
tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg 2 7000 
gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga 2 70 60 
cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct 27120 
acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc 27180 
acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg 27240 
gacggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc 2 7300 
cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag 27360 
ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag 2 742 0 
gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct 2 7480 
tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg 27540 
gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg 27600 
ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt 27660 
aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca 2 7720 
tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc 27780 
ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat 27840 
tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc 27900 
agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc 27960 
cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc 2 8 020 
gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag 28080 
gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg 28140 
cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt 2 8200 
cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc 2 82 60 
gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat 2 832 0 
tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc 28380 
ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat 2 844 0 
gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc 2 8500 
tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta 28560 
ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca 2 8 620 
gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg 28680 
gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt 2 8740 
gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc 28800 



WO 2004/099422 



PCT/US2004/009219 



-87- 



ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc 28860 
tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc 2 8 920 
cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa 28980 
ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa 29 04 0 
acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta 29100 
ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca 29160 
gggagagctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg 2 922 0 
ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca 29280 
agatctttgt tgccatctct gtgctgagta taataaatac agaaattaaa atatactggg 2 934 0 
gctcctatcg ccatcctgta aacgccaccg tcttcacccg cccaagcaaa ccaaggcgaa 29400 
ccttacctgg tacttttaac atctctccct ctgtgattta caacagtttc aacccagacg 29460 
gagtgagtct acgagagaac ctctccgagc tcagctactc catcagaaaa aacaccaccc 29520 
tccttacctg ccgggaacgt acgagtgcgt caccggccgc tgcaccacac ctaccgcctg 29580 
accgtaaacc agactttttc cggacagacc tcaataactc tgtttaccag aacaggaggt 29640 
gagcttagaa aacccttagg gtattaggcc aaaggcgcag ctactgtggg gtttatgaac 29700 
aattcaagca actctacggg ctattctaat tcaggtttct ctagaaatgg acggaattat 29760 
tacagagcag cgcctgctag aaagacgcag ggcagcggcc gagcaacagc gcatgaatca 2 9 820 
agagctccaa gacatggtta acttgcacca gtgcaaaagg ggtatctttt gtctggtaaa 29880 
gcaggccaaa gtcacctacg acagtaatac caccggacac cgccttagct acaagttgcc 29940 
aaccaagcgt cagaaattgg tggtcatggt gggagaaaag cccattacca taactcagca 3 0000 
ctcggtagaa accgaaggct gcattcactc accttgtcaa ggacctgagg atctctgcac 3 0 060 
ccttattaag accctgtgcg gtctcaaaga tcttattccc tttaactaat aaaaaaaaat 3 012 0 
aataaagcat cacttactta aaatcagtta gcaaatttct gtccagttta ttcagcagca 30180 
cctccttgcc ctcctcccag ctctggtatt gcagcttcct cctggctgca aactttctcc 30240 
acaatctaaa tggaatgtca gtttcctcct gttcctgtcc atccgcaccc actatcttca 30300 
tgttgttgca gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 3 03 60 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 3 042 0 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 30480 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 3 054 0 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 30600 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 3 0 660 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 3 0 720 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 3 07 80 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 3 0 840 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 3 0 900 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 30960 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 3102 0 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 31080 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 31140 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 312 0 0 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 31260 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 3132 0 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 31380 
tcaaaacaaa aattggccat ggcctagaat ttgafctcaaa caaggctatg gttcctaaac 31440 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 31500 
ataagctaac tttgtggacc acaccagctc catctcctaa ctgtagacta aatgcagaga 31560 
aagatgctaa actcactttg gtcttaacaa aatgtggcag tcaaatactt gctacagttt 31620 
cagttttggc tgttaaaggc agtttggctc caatatctgg aacagttcaa agtgctcatc 31680 
ttattataag atttgacgaa aatggagtgc tactaaacaa ttccttcctg gacccagaat 31740 
attggaactt tagaaatgga gatcttactg aaggcacagc ctatacaaac gctgttggat 3180 0 
ttatgcctaa cctatcagct tatccaaaat ctcacggtaa aactgccaaa agtaacattg 31860 
tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgt aacactaacc attacactaa 3192 0 
acggtacaca ggaaacagga gacacaactc caagtgcata ctctatgtca ttttcatggg 31980 
actggtctgg ccacaactac attaatgaaa tatttgccac atcctcttac actttttcat 32040 
acattgccca agaataaaga atcgtttgtg ttatgtttca acgtgtttat ttttcaattg 32100 
cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca tagcttatac 32160 
agatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc acctccctcc 32220 
caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat catatcatgg 32280 
gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc caaacgctca 32340 
tcagtgatat taataaactc cccgggcagc tcacttaagt tcatgtcgct gtccagctgc 3240 0 
tgagccacag gctgctgtcc aacttgcggt tgcttaacgg gcggcgaagg agaagtccac 32460 
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gcctacatgg 
gcgcgaataa 
tcctcagcga 
cgcaccctga 
aaaatcccac 
tggccatcat 
ataaacatta 
tgattaaaca 
gctatacact 
ccatggatca 
cacttcctca 
tcctgaatca 
attgtcaaag 
tctgtctcaa 
cgtgttggtc 
aaaccaggtg 
tgtgtagtag 
ttctatgtaa 
cacacccagc 
tggaagaacc 
ctattaagtg 
taatggcatt 
agtggacgta 
caaccatgcc 
gaatattaag 
agcagcgaat 
cggaacatta 
atcgtgcagg 
acccacactg 
agctttgttg 
cctcgcgcaa 
ccggaaccac 
taaacacaaa 
aaaaacaacc 
ctggtcaccg 
gtaagactcg 
agcccggggg 
taacaaaatt 
aaatagcacc 
tcagccttac 
tcaatcagtc 
atgacgtaac 
agaaacgaaa 
acgtcacttc 
cctaaaacct 
cctcattatc 



gggtagagtc 
actgctgccg 
tgattcgcac 
tctcacttaa 
agtgcaaggc 
accacaagcg 
cctcttttgg 
tggcgccatc 
gcagggaacc 
tcatgctcgt 
ggattacaag 
gcgtaaatcc 
tgttacattc 
aaggaggtag 
gtagtgtcat 
cgggcgtgac 
ttgtagtata 
actccttcat 
caacctacac 
atgttttttt 
aacgcgctcc 
tgtaagatgt 
aaggctaaac 
caaataattc 
tccggccatt 
catgattgca 
acaaaaatac 
tctgcacgga 
attatgacac 
catgggcggc 
aaaagaaagc 
cacagaaaaa 
ataaaataac 
cttataagca 
tgattaaaaa 
gtaaacacat 
aatacatacc 
aataggagag 
ctcccgctcc 
cagtaaaaaa 
acagtgtaaa 
ggttaaagtc 
gccaaaaaac 
ccattttaat 
acgtcacccg 
atattggctt 



ataatcgtgc 
ccgccgctcc 
cgcccgcagc 
atcagcacag 
gctgtatcca 
caggtagatt 
catgttgtaa 
cacca.cca.tc 
gggactggaa 
catgatatca 
ctcctcccgc 
cacactgcag 
gggcagcagc 
acgatcccta 
gccaaatgga 
aaacagatct 
tccactctct 
gcgccgctgc 
attcgttctg 
ttttattcca 
cctccggtgg 
tgcacaatgg 
ccttcagggt 
tcatctcgcc 
gtaaaaatct 
aaaattcagg 
cgcgatcccg 
ccagcgcggc 
gcatactcgg 
gatataaaat 
acatcgtagt 
gacaccattt 
aaaaaaacat 
taagacggac 
gcaccaccga 
caggttgatt 
cgcaggcgta 
aaaaacacat 
agaacaacat 
gaaaacctat 
aaagggccaa 
cacaaaaaac 
ccacaacttc 
taagaaaact 
ccccgttccc 
caatccaaaa 



atcaggatag 
gtcctgcagg 
ataaggcgcc 
taactgcagc 
aagctcatgg 
aagtggcgac 
ttcaccacct 
ctaaaccagc 
caatgacagt 
atgttggcac 
gttagaacca 
ggaagacctc 
ggatgatcct 
ctgtacggag 
acgccggacg 
gcgtctccgg 
caaagcatcc 
cctgataaca 
cgagtcacac 
aaagattatc 
cgtggtcaaa 
cttccaaaag 
gaatctcctc 
accttctcaa 
gctccagagc 
ttcctcacag 
taggtccctt 
cacttccccg 
agctatgcta 
gcaaggtgct 
catgctcatg 
ttctctcaaa 
ttaaacatta 
tacggccatg 
cagctcctcg 
catcggtcag 
gagacaacat 
aaacacctga 
acagcgcttc 
taaaaaaaca 
gtgcagagcg 
acccagaaaa 
ctcaaatcgt 
acaattccca 
acgccccgcg 
taaggtatat 



ggcggtggtg 
aatacaacat 
ttgtcctccg 
acagcaccac 
cggggaccac 
ccctcataaa 
cccggtacca 
tggccaaaac 
ggagagccca 
aacacaggca 
tatcccaggg 
gcacgtaact 
ccagtatggt 
tgcgccgaga 
tagtcatatt 
tctcgccgct 
aggcgccccc 
tccaccaccg 
acgggaggag 
caaaacctca 
ctctacagcc 
gcaaacggcc 
tataaacatt 
tatatctcta 
gccctccacc 
acctgtataa 
cgcagggcca 
ccaggaacct 
accagcgtag 
gctcaaaaaa 
cagataaagg 
catgtctgcg 
gaagcctgtc 
ccggcgtgac 
gtcatgtccg 
tgctaaaaag 
tacagccccc 
aaaaccctcc 
cacagcggca 
ccactcgaca 
agtatatata 
ccgcacgcga 
cacttccgtt 
acacatacaa 
ccacgtcaca 
tattgatgat 



ctgcagcagc 
ggcagtggtc 
ggcacagcag 
aatattgttc 
agaacccacg 
cacgctggac 
tataaacctc 
ctgcccgccg 
ggactcgtaa 
cacgtgcata 
aacaacccat 
cacgttgtgc 
agcgcgggtt 
caaccgagat 
tcctgaagca 
tagatcgctc 
tggcttcggg 
cagaataagc 

cgggaagagc 

aaatgaagat 
aaagaacaga 
ctcacgtcca 
ccagcacctt 
agcaaatccc 
ttcagcctca 
gattcaaaag 
gctgaacata 
tgacaaaaga 
ccccgatgta 
tcaggcaaag 
caggtaagct 
ggtttctgca 
ttacaacagg 
cgtaaaaaaa 
gagtcataat 
cgaccgaaat 
ataggaggta 
tgcctaggca 
gccataacag 
cggcaccagc 
ggactaaaaa 
acctacgccc 
ttcccacgtt 
gttactccgc 
aactccaccc 



32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35211 



<210> 52 
<211> 33622 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid Av3nBg 



<400> 52 

catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 



ggggtggagt 
gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 



60 

120 

180 

240 

300 
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agtgaaatct 
gactttgacc 
acgcgtatct 
gttaggagtc 
cttatgcaat 
aaggagagaa 
tattaggaag 
cagagatatt 
cattggtgtg 
ctgttgggct 
aacccgtcgg 
cggatcggaa 
gatcccccgg 
gtagaagacc 
gtcgttttac 
gcacatcccc 
caacagttgc 
gtgccggaaa 
aactggcaga 
gtcaatccgc 
gttgatgaaa 
gcgtttcatc 
tctgaatttg 
ctgcgttgga 
ttccgtgacg 
actcgcttta 
gagttgcgtg 
agcggcaccg 
gtcacactac 
tatcgtgcgg 
gatgtcggtt 
ttgctgattc 
gagcagacga 
tgttcgcatt 
gtggtggatg 
gatccgcgct 
aatcacccga 
gacgcgctgt 
ggcggagccg 
gaccagccct 
gagacgcgcc 
ttcgctaaat 
gactgggtgg 
ggcggtgatt 
gccgaccgca 
ttccgtttat 
aacgagctcc 
cctctggatg 
gagagcgccg 
tcagaagccg 
acgctccccg 
atcgagctgg 
tggattggcg 
ctggataacg 
cgctggaagg 
acacttgctg 
ttatttatca 
gatgttgaag 
gcgcaggtag 
cgccttactg 
tacgtcttcc 



gaataatttt 
gtttacgtgg 
gaggggacta 
ccctcaggat 
actcttgtag 
aaagcaccgt 
gcaacagacg 
gtatttaagt 
cacctccggc 
cgcggttgag 
cctccgaacg 
aacctctcga 
gctgcaggaa 
ccaaggactt 
aacgtcgtga 
ctttcgccag 
gcagcctgaa 
gctggctgga 
tgcacggtta 
cgtttgttcc 
gctggctaca 
tgtggtgcaa 
acctgagcgc 
gtgacggcag 
tctcgttgct 
atgatgattt 
actacctacg 
cgcctttcgg 
gtctgaacgt 
tggttgaact 
tccgcgaggt 
gaggcgttaa 
tggtgcagga 
atccgaacca 
aagccaatat 
ggctaccggc 
gtgtgatcat 
atcgctggat 
acaccacggc 
tcccggctgt 
cgctgatcct 
actggcaggc 
atcagtcgct 
ttggcgatac 
cgccgcatcc 
ccgggcaaac 
tgcactggat 
tcgctccaca 
ggcaactctg 
ggcacatcag 
ccgcgtccca 
gtaataagcg 
ataaaaaaca 
acattggcgt 
cggcgggcca 
atgcggtgct 
gccggaaaac 
tggcgagcga 
cagagcgggt 
ccgcctgttt 
cgagcgaaaa 



gtgttactca 
agactcgccc 
gggtgtgttt 
atagtagttt 
tcttgcaaca 
gcatgccgat 
ggtctgacat 
gcctagctcg 
cctggccact 
gacaaactct 
gtactccgcc 
gaaaggcgtg 
ttcgatctag 
tccttcagaa 
ctgggaaaac 
ctggcgtaat 
tggcgaatgg 
gtgcgatctt 
cgatgcgccc 
cacggagaat 
ggaaggccag 
cgggcgctgg 
atttttacgc 
ttatctggaa 
gcataaaccg 
cagccgcgct 
ggtaacagtt 
cggtgaaatt 
cgaaaacccg 
gcacaccgcc 
gcggattgaa 
ccgtcacgag 
tatcctgctg 
tccgctgtgg 
tgaaacccac 
gatgagcgaa 
ctggtcgctg 
caaatctgtc 
caccgatatt 
gccgaaatgg 
ttgcgaatac 
gtttcgtcag 
gattaaatat 
gccgaacgat 
agcgctgacg 
catcgaagtg 
ggtggcgctg 
aggtaaacag 
gctcacagta 
cgcctggcag 
cgccatcccg 
ttggcaattt 
actgctgacg 
aagtgaagcg 
ttaccaggcc 
gattacgacc 
ctaccggatt 
tacaccgcat 
aaactggctc 
tgaccgctgg 
cggtctgcgc 



tagcgcgtaa 
agggcgcgcc 
aggcgaaaag 
cgcttttgca 
tggtaacgat 
tggtggaagt 
ggattggacg 
atacaataaa 
ctcttccgca 
tcgcggtctt 
gccgagggac 
taaccagtca 
atggataaag 
ttgctaagtt 
cctggcgtta 
agcgaagagg 
cgctttgcct 
cctgaggccg 
atctacacca 
ccgacgggtt 
acgcgaatta 
gtcggttacg 
gccggagaaa 
gatcaggata 
actacacaaa 
gtactggagg 
tctttatggc 
atcgatgagc 
aaactgtgga 
gacggcacgc 
aatggtctgc 
catcatcctc 
atgaagcaga 
tacacgctgt 
ggcatggtgc 
cgcgtaacgc 
gggaatgaat 
gatccttccc 
atttgcccga 
tccatcaaaa 
gcccacgcga 
tatccccgtt 
gatgaaaacg 
cgccagttct 
gaagcaaaac 
accagcgaat 
gatggtaagc 
ttgattgaac 
cgcgtagtgc 
cagtggcgtc 
catctgacca 
aaccgccagt 
ccgctgcgcg 
acccgcattg 
gaagcagcgt 
gctcacgcgt 
gatggtagtg 
ccggcgcgga 
ggattagggc 
gatctgccat 
tgcgggacgc 



tatttgtcta 
ccgatgtacg 
cggggcttcg 
tagggagggg 
gagttagcaa 
aaggtggtac 
aaccactgaa 
cgccatttga 
tcgctgtctg 
tccagtactc 
ctgagcgagt 
cagtcgctct 
gtccaaaaaa 
ttttgagtga 
cccaacttaa 
cccgcaccga 
ggtttccggc 
atactgtcgt 
acgtaaccta 
gttactcgct 
tttttgatgg 
gccaggacag 
accgcctcgc 
tgtggcggat 
tcagcgattt 
ctgaagttca 
agggtgaaac 
gtggtggtta 
gcgccgaaat 
tgattgaagc 
tgctgctgaa 
tgcatggtca 
acaactttaa 
gcgaccgcta 
caatgaatcg 
gaatggtgca 
caggccacgg 
gcccggtgca 
tgtacgcgcg 
aatggctttc 
tgggtaacag 
tacagggcgg 
gcaacccgtg 
gtatgaacgg 
accagcagca 
acctgttccg 
cgctggcaag 
tgcctgaact 
aaccgaacgc 
tggcggaaaa 
ccagcgaaat 
caggctttct 
atcagttcac 
accctaacgc 
tgttgcagtg 
ggcagcatca 
gtcaaatggc 
ttggcctgaa 
cgcaagaaaa 
tgtcagacat 
gcgaattgaa 



gggccgcggg 
ggccagatat 
gttgtacgcg 
gaaatgtagt 
catgccttac 
gatcgtgcct 
ttccgcattg 
ccattcacca 
cgggggccag 
ttggatcgga 
ccgcatcgac 
agaactagtg 
gaagagaaag 
ttcactggcc 
tcgccttgca 
tcgcccttcc 
accagaagcg 
cgtcccctca 
tcccattacg 
cacatttaat 
cgttaactcg 
tcgtttgccg 
ggtgatggtg 
gagcggcatt 
ccatgttgcc 
gatgtgcggc 
gcaggtcgcc 
tgccgatcgc 
cccgaatctc 
agaagcctgc 
cggcaagccg 
ggtcatggat 
cgccgtgcgc 
cggcctgtat 
tctgaccgat 
gcgcgatcgt 
cgctaatcac 
gtatgaaggc 
cgtggatgaa 
gctacctgga 
tcttggcggt 
cttcgtctgg 
gtcggcttac 
tctggtcttt 
gtttttccag 
tcatagcgat 
cggtgaagtg 
accgcagccg 
gaccgcatgg 
cctcagtgtg 
ggatttttgc 
ttcacagatg 
ccgtgcaccg 
ctgggtcgaa 
cacggcagat 
ggggaaaacc 
gattaccgtt 
ctgccagctg 
ctatcccgac 
gtataccccg 
ttatggccca 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



WO 2004/099422 



PCT/US2004/009219 



-90- 



caccagtggc gcggcgactt ccagttcaac 
gaaaccagcc atcgccatct gctgcacgcg 
ttccatatgg ggattggtgg cgacgactcc 
ctgagcgccg gtcgctacca ttaccagttg 
aagcttatcg ataccgtcga aacttgttta 
atagcatcac aaatttcaca aataaagcat 
ccaaactcat caatgtatct tatcatgtct 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 



atcagccgct acagtcaaca gcaactgatg 4 02 0 
gaagaaggca catggctgaa tatcgacggt 40 80 
tggagcccgt cagtatcggc ggaatttcag 4140 
gtctggtgtc aaaaataata atctcgaatc 4200 
ttgcagctta taatggttac aaataaagca 4260 
ttttttcact gcattctagt tgtggtttgt 432 0 
ggatccgacc tcggatctgg aaggtgctga 43 8 0 
cctgcgagtg tggcggtaaa catattagga 444 0 
agctgaggcc cgatcacttg gtgctggcct 450 0 
aagatacaga ttgaggtact gaaatgtgtg 4560 
aaggtggggg tcttatgtag ttttgtatct 4 62 0 
ccaactcgtt tgatggaagc attgtgagct 4680 
ccggggtgcg tcagaatgtg atgggctcca 4 74 0 
actctactac cttgacctac gagaccgtgt 480 0 
ccgccgcttc agccgctgca gccaccgccc 4 86 0 
gcccgcttgc aagcagtgca gcttcccgtt 4 92 0 
ttttggcaca attggattct ttgacccggg 4980 
atctgcgcca gcaggtttct gccctgaagg 5 04 0 
taaataaaaa accagactct gtttggattt 510 0 
taggggtttt gcgcgcgcgg taggcccggg 5160 
gtattttttc caggacgtgg taaaggtgac 5220 
cgtctctggg gtggaggtag caccactgca 52 8 0 
tgatccagtc gtagcaggag cgctgggcgt 5340 
tgattgccag gggcaggccc ttggtgtaag 54 0 0 
gcatacgtgg ggatatgaga tgcatcttgg 54 60 
ccatatccct ccggggattc atgttgtgca 552 0 
tgggaaattt gtcatgtagc ttagaaggaa 55 8 0 
gacctccaag attttccatg cattcgtcca 5640 
cctgggcgaa gatatttctg ggatcactaa 5700 
cataggccat ttttacaaag cgcgggcgga 5760 
ccggcccagg ggcgtagtta ccctcacaga 5 82 0 
gggggatcat gtctacctgc ggggcgatga 5 88 0 
gctgggaaga aagcaggttc ctgagcagct 594 0 
tcacacctat taccggctgc aactggtagt 6 00 0 
gcaggggggc cacttcgtta agcatgtccc 6 060 
ccagaaggcg ctcgccgccc agcgatagca 612 0 
gtttgagacc gtccgccgta ggcatgcttt 6180 
cccacagctc ggtcacctgc tctacggcat 6240 
gttggggcgg ctttcgctgt acggcagtag 63 0 0 
gtctttccac gggcgcaggg tcctcgtcag 63 60 
tccgggctgc gcgctggcca gggtgcgctt 642 0 
ccggtcttcg ccctgcgcgt cggccaggta 6480 
ctccgcggcg tggcccttgg cgcgcagctt 6540 
gtgcagactt ttgagggcgt agagcttggg 660 0 
atccgcgccg caggccccgc agacggtctc 6 660 
ttcggggtca aaaaccaggt ttcccccatg 6720 
catgagccgg tgtccacgct cggtgacgaa 6 780 
gagaggcctg tcctcgagcg gtgttccgcg 6 84 0 
tgagacaaag gctcgcgtcc aggccagcac 6 90 0 
gttgtccact agggggtcca ctcgctccag 6960 
atcaaggaag gtgattggtt tgtaggtgta 7 02 0 
gctataaaag ggggtggggg cgcgttcgtc 70 8 0 
ggccagctgt tggggtgagt actccctctg 7140 
gtcagtttcc aaaaacgagg aggatttgat 72 00 
ggtggccgca tccatctggt cagaaaagac 72 6 0 
cgacccgtag agggcgttgg acagcaactt 732 0 
gcgatcggcg cgctccttgg ccgcgatgtt 73 80 
ccattcggga aagacggtgg tgcgctcgtc 744 0 
gtgcagggtg acaaggtcaa cgctggtggc 7500 
gcagaggcgg ccgcccttgc gcgagcagaa 75 60 
cggggggtct gcgtccacgg taaagacccc 7620 
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gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 7680 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 774 0 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 780 0 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 7860 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 792 0 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 7 98 0 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 8 04 0 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 810 0 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 8160 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 822 0 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 82 8 0 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 834 0 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 840 0 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 8460 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 852 0 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 8580 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 864 0 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 8700 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 8760 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 8 82 0 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 8 880 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 894 0 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 9000 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 90 60 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 912 0 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 9180 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 9240 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 93 0 0 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 93 60 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 942 0 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 94 80 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 9540 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 9600 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 9660 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 9720 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 9780 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 9840 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 9900 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 9960 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 10020 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 10 080 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 1014 0 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 10200 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 10260 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 1032 0 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 103 80 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 10440 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 10500 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 10560 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 10620 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 10680 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 1074 0 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 10 80 0 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 10860 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 10920 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 10980 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 11040 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 11100 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 11160 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 11220 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 112 80 
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tggtga-cccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 1134 0 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 11400 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 114 60 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 1152 0 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 11580 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagac cgtgcaaaag 11640 
gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa gggtatcatg 11700 
gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat gcggttaccg 1176 0 
cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc ttttggcttc 1182 0 
cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc gcagcgtaag 11880 
cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg agggttattt 1194 0 
tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accggccgga ctgcggcgaa 12000 
cgggggtttg cctccccgtc atgcaagacc ccgcttgcaa attcctccgg aaacagggac 12 0 60 
gagccccttt tttgcttttc ccagatgcat ccggtgctgc ggcagatgcg cccccctcct 1212 0 
cagcagcggc aagagcaaga gcagcggcag acatgcaggg caccctcccc tcctccta.cc 1218 0 
gcgtcaggag gggcgacatc cgcggttgac gcggcagcag atggtgatta cgaacccccg 12240 
cggcgccggg cccggcacta cctggacttg gaggagggcg agggcctggc gcggctagga 123 00 
gcgccctctc ctgagcggta cccaagggtg cagctgaagc gtgatacgcg tgaggcgtac 12360 
gtgccgcggc agaacctgtt tcgcgaccgc gagggagagg agcccgagga gatgcgggat 1242 0 
cgaaagttcc acgcagggcg cgagctgcgg catggcctga atcgcgagcg gttgctgcgc 1248 0 
gaggaggact ttgagcccga cgcgcgaacc gggattagtc ccgcgcgcgc acacgtggcg 1254 0 
gccgccgacc tggtaaccgc atacgagcag acggtgaacc aggagattaa ctttcaaaaa 12600 
agctttaaca accacgtgcg tacgcttgtg gcgcgcgagg aggtggctat aggactgatg 1266 0 
catctgtggg actttgtaag cgcgctggag caaaacccaa atagcaagcc gctcatggcg 12720 
cagctgttcc ttatagtgca gcacagcagg gacaacgagg cattcaggga tgcgctgcta 12780 
aacatagtag agcccgaggg ccgctggctg ctcgatttga taaacatcct gcagagcata 12 84 0 
gtggtgcagg agcgcagctt gagcctggct gacaaggtgg ccgccatcaa ctattccatg 1290 0 
cttagcctgg gcaagtttta cgcccgcaag atataccata ccccttacgt tcccatagac 12960 
aaggaggtaa agatcgaggg gttctacatg cgcatggcgc tgaaggtgct taccttgagc 13020 
gacgacctgg gcgtttatcg caacgagcgc atccacaagg ccgtgagcgt gagccggcgg 13080 
cgcgagctca gcgaccgcga gctgatgcac agcctgcaaa gggccctggc tggcacgggc 1314 0 
agcggcgata gagaggccga gtcctacttt gacgcgggcg ctgacctgcg ctgggcccca 132 00 
agccgacgcg ccctggaggc agctggggcc ggacctgggc tggcggtggc acccgcgcgc 13260 
gctggcaacg tcggcggcgt ggaggaatat gacgaggacg atgagtacga gccagaggac 13320 
ggcgagtact aagcggtgat gtttctgatc agatgatgca agacgcaacg gacccggcgg 133 80 
tgcgggcggc gctgcagagc cagccgtccg gccttaactc cacggacgac tggcgccagg 13440 
tcatggaccg catcatgtcg ctgactgcgc gcaatcctga cgcgttccgg cagcagccgc 13500 
aggccaaccg gctctccgca attctggaag cggtggtccc ggcgcgcgca aaccccacgc 135 60 
acgagaaggt gctggcgatc gtaaacgcgc tggccgaaaa cagggccatc cggcccgacg 13620 
aggccggcct ggtctacgac gcgctgcttc agcgcgtggc tcgttacaac agcggcaacg 13680 
tgcagaccaa cctggaccgg ctggtggggg atgtgcgcga ggccgtggcg cagcgtgagc 13740 
gcgcgcagca gcagggcaac ctgggctcca tggttgcact aaacgccttc ctgagtacac 13 80 0 
agcccgccaa cgtgccgcgg ggacaggagg actacaccaa ctttgtgagc gcactgcggc 13860 
taatggtgac tgagacaccg caaagtgagg tgtaccagtc tgggccagac tattttttcc 13 92 0 
agaccagtag acaaggcctg cagaccgtaa acctgagcca ggctttcaaa aacttgcagg 13 980 
ggctgtgggg ggtgcgggct cccacaggcg accgcgcgac cgtgtctagc ttgctgacgc 14 040 
ccaactcgcg cctgttgctg ctgctaatag cgcccttcac ggacagtggc agcgtgtccc 14100 
gggacacata cctaggtcac ttgctgacac tgtaccgcga ggccataggt caggcgcatg 14160 
tggacgagca tactttccag gagattacaa gtgtcagccg cgcgctgggg caggaggaca 14220 
cgggcagcct ggaggcaacc ctaaactacc tgctgaccaa ccggcggcag aagatcccct 142 80 
cgttgcacag tttaaacagc gaggaggagc gcattttgcg ctacgtgcag cagagcgtga 1434 0 
gccttaacct gatgcgcgac ggggtaacgc ccagcgtggc gctggacatg accgcgcgca 1440 0 
acatggaacc gggcatgtat gcctcaaacc ggccgtttat caaccgccta atggactact 14460 
tgcatcgcgc ggccgccgtg aaccccgagt atttcaccaa tgccatcttg aacccgcact 1452 0 
ggctaccgcc ccctggtttc tacaccgggg gattcgaggt gcccgagggt aacgatggat 1458 0 
tcctctggga cgacatagac gacagcgtgt tttccccgca accgcagacc ctgctagagt 14640 
tgcaacagcg cgagcaggca gaggcggcgc tgcgaaagga aagcttccgc aggccaagca 1470 0 
gcttgtccga tctaggcgct gcggccccgc ggtcagatgc tagtagccca tttccaagct 14760 
tgatagggtc tcttaccagc actcgcacca cccgcccgcg cctgctgggc gaggaggagt 14820 
acctaaacaa ctcgctgctg cagccgcagc gcgaaaaaaa cctgcctccg gcatttccca 14880 
acaacgggat agagagccta gtggacaaga tgagtagatg gaagacgtac gcgcaggagc 14 94 0 
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acagggacgt 
gtctggtgtg 
gtggcaaccc 
gcatgatgca 
tccccttagt 
tgtggtgagc 
cccgccgttt 
ctctgagttg 
ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 
gaccatgacc 
acagaacggg 

ggggtttgac 

tccagacatc 
cttgttgggc 
tctggagggt 
agatgacacc 
ggaagagaac 
tgccattcgc 
agcggccgaa 
ggtgatcaaa 
cagcaccttc 
cggaatccgc 
ctactggtcg 
cagcaacttt 
cgaccaggcc 
tcgctttccc 
tgaaaacgtt 
agtccagcga 
cctgggcata 
ccttatatcg 
cggggccaag 
ctggggcgcg 
ggtggtggag 
ggccattcag 
gcgcgtagca 
cctgcttaac 
ggccgcgggt 
cgcggccatt 
ggttagcggc 
aaactactta 
gtccaagcgc 
cccgaagaag 
gaaagatgat 
gcgacgggta 
agtctttacg 
cggcgacgag 
gcggcataag 
gcccgtaaca 
aaagcgcgag 
actggaagat 
gcggccaatc 
cactaccagt 
ggttgcctca 
ctctacggag 
cggttcgagg 
cattgcgcct 
ccgacgccga 



gccaggcccg 
ggaggacgat 
gtttgcgcac 
aaataaaaaa 
atgcggcgcg 
gcggcgccag 
gtgcctccgc 
gcacccctat 
tccctgaact 
tacagcccgg 
ggcgacctga 
aataagttta 
ctgaaatacg 
atagacctta 
gttctggaaa 
cccgtcactg 
attttgctgc 
atccgcaagc 
ggtaacattc 
gaacagggcg 
tccaacgcgg 
ggcgacacct 
gctgccgccc 
cccctgacag 
acccagtacc 
tcatggaccc 
ttgccagaca 
ccggtggtgg 
gtctactccc 
gagaaccaga 
cctgctctca 
gtgaccatta 
gtctcgccgc 
cccagcaata 
aagcgctccg 
cacaaacgcg 
gaggcgcgca 
accgtggtgc 
cgtcgccacc 
cgcgcacgtc 
attgtcactg 
agtgctatga 
ctgcgcgtgc 
gactcgtact 
aaaatcaaag 
gaagagcagg 
gatgatgaac 
cagtggaaag 
cccggtgagc 
gacctgcttg 
gacatgctgg 
ctgcagcagg 
tctggtgact 
gtcttggaaa 
aagcaggtgg 
agcaccagta 
gcggtggcgg 
gtgcaaacgg 
aagtacggcg 
acccccggct 
accaccactg 



cgcccgccca 
gactcggcag 
cttcgcccca 
ctcaccaagg 
cggcgatgta 
tggcggcggc 
ggtacctgcg 
tcgacaccac 
accagaacga 
gggaggcaag 
aaaccatcct 
aggcgcgggt 
agtgggtgga 
tgaacaacgc 
gcgacatcgg 
gtcttgtcat 
caggatgcgg 
ggcaaccctt 
ccgcactgtt 

ggggtggcgc 

cagccgcggc 
ttgccacacg 
ccgctgcgca 
aggacagcaa 
gcagctggta 
tgctttgcac 
tgatgcaaga 
gcgccgagct 
aactcatccg 
ttttggcgcg 
cagatcacgg 
ctgacgccag 
gcgtcctatc 
acacaggctg 
accaacaccc 
gccgcactgg 
actacacgcc 
gcggagcccg 
gccgccgacc 
gcaccggccg 
tgccccccag 
ctcagggtcg 
ccgtgcgcac 
gttgtatgta 
aagagatgct 
attacaagcc 
ttgacgacga 
gtcgacgcgt 
gctccacccg 
agcaggccaa 
cgttgccgct 
tgctgcccgc 
tggcacccac 
aaatgaccgt 
cgccgggact 
ttgccaccgc 
atgccgcggt 
acccgtggat 
ccgccagcgc 
atcgtggcta 
gaacccgccg 



cccgtcgtca 
acgacagcag 
ggctggggag 
ccatggcacc 
tgaggaaggt 
gctgggttct 
gcctaccggg 
ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 
gttcacgctg 
gatcgtggag 
ggtaaagttt 
gcctggggta 
ggtggacttc 
ccaggagggc 
ggatgtggac 
aggcggcagc 
aatgcagccg 
ggctgaggag 
acccgaggtc 
gaaacgcagt 
ccttgcatac 
tcctgacgta 
ccccgtgacc 
gttgcccgtg 
ccagtttacc 
cccgccagcc 
gacgctaccg 
acgccgcacc 
gagccgcact 
gggcctgcgc 
agtgcgcgtg 
gcgcaccacc 
cacgccgcca 
gcgctatgct 
cggcactgcc 
acgggcggcc 
gtccaggcga 
caggggcaac 
ccgccccccg 
tccagcggcg 
ccaggtcatc 
ccgaaagcta 
ggtggaactg 
aaaacgtgtt 
cacctacaag 
cgagcgcctc 
ggacgagggc 
gcttgcaccg 
cgtgcagctg 
ggaacctggg 

gggcgtgcag 

cacagagggc 
gcaggcggtc 
gtttcgcgtt 
gctactgccc 
cacctaccgc 
ccgccgtcgc 



aaggcacgac 
cgtcctggat 
aatgttttaa 
gagcgttggt 
cctcctccct 
cccttcgatg 
gggagaaaca 
ctggtggaca 
tttctgacca 
atcaatcttg 
afcgccaaatg 
cgcttgccta 
cccgagggca 
cactacttga 
gacacccgca 
tatacaaacg 
acccacagcc 
tttaggatca 
gcctaccagg 
aacagcagtg 
gtggaggaca 
aagcgcgctg 
gagaagcctc 
tacaacctaa 
aactacggcg 
acctgcggct 
ttccgctcca 
cactccaaga 
tctctgaccc 
cccaccatca 
ctgcgcaaca 
tgcccctacg 
ttttgagcaa 
ttcccaagca 
cgcgggcact 
gtcgatgacg 
ccagtgtcca 
aaaatgaaga 
gcccaacgcg 
atgcgggccg 
cgagcggccg 
gtgtattggg 
cgcaactaga 
gcggcgcgca 
gcgccggaga 
aagcgggtca 
ctgcacgcta 
ttgcgacccg 
cgcgtgtatg 
ggggagtttg 
aacccaacac 
tccgaagaaa 
atggtaccca 
ctggagcccg 
accgtggacg 
atggagacac 
gctgcggccg 
tcagcccccc 
gaatatgccc 
cccagaagac 
cgtcgccagc 



cgtcagcggg 
ttgggaggga 
aaaaaaaaaa 
tttcttgtat 
cctacgagag 
ctcccctgga 
gcatccgtta 
acaagtcaac 
cggtcattca 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 
actactccga 
aagtgggcag 
acttcagact 
aagccttcca 
gcctgagcaa 
cctacgatga 
cgagcttgaa 
gcagcggcgc 
tgaacgatca 
aggccgaagc 
agaagaaacc 
taagcaatga 
accctcagac 
cggagcaggt 
cgcgccagat 
gcttctacaa 
acgtgttcaa 
ccaccgtcag 
gcatcggagg 
tttacaaggc 
gcatgtccat 
agatgtttgg 
accgcgcgcc 
ccatcgacgc 
cagtggacgc 
gacggcggag 
cggcggcggc 
ctcgaaggct 
ccgcagcagc 
tgcgcgactc 
ttgcaagaaa 
acgaagctat 
tctatggccc 
aaaagaaaaa 
ccgcgcccag 
gcaccaccgt 
atgaggtgta 
cctacggaaa 
ctagcctaaa 
agcgcggcct 
agcgccagcg 
aggtccgcgt 
ttcagatacc 
aaacgtcccc 
cgtccaagac 
ggcgcccgcg 
tacatccttc 
gagcaactac 
ccgtgctggc 



15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
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cccgatttcc 
gcgctaccac 
cacctgccgc 
catggccggc 
gtcgcaccgt 
gattggcgcc 
aacaagttgc 
tgtaactatt 
cgcgcccgtt 
tcagctgggg 
gcagcaaggc 
atttccaaca 
ccaaccaggc 
agcctccacc 
ccgacaggga 
taaagcaagg 
agcacacacc 
tgccaggccc 
ccagcggtcc 
gcatcgtggg 
tgtcgtatgt 
gcgcccgctt 
ctcgggccag 
cgagacgtac 
cgtgaccaca 
tactgcgtac 
catggcttcc 
ctactctggc 
ggatgaagct 
cgaagtagac 
tggtataaat 
tgccgataaa 
aattaatcat 
ttcatatgca 
tggaaagcta 
caatggtgat 
cccagacact 
gggccaacaa 
tctaatgtat 
gaatgctgtt 
ttccattggt 
tccagatgtt 
tccactggga 
ggaaaatgga 
aaataatttt 
caacatagcg 
tgataaccca 
ctgctacatt 
taaccaccac 
tgtgcccttc 
gccgggctca 
ctccctagga 
ttacgccacc 
aaacgacacc 
tatacccgcc 
ccgcggctgg 
cgacccttat 
ccacaccttt 
ccgcctgctt 
cgttgcccag 
cattggctac 



gtgcgcaggg 
cccagcatcg 
ctccgtttcc 
cacggcctga 
cgcatgcgcg 
gtgcccggaa 
atgtggaaaa 
ttgtagaatg 
catgggaaac 
ctcgctgtgg 
ctggaacagc 
aaaggtggta 
agtgcaaaat 
ggccgtggag 
agaaactctg 
cctgcccacc 
cgtaacgctg 
gaccgccgtt 
gcgatcgttg 
tctgggggtg 
gtgtcatgta 
tccaagatgg 
gacgcctcgg 
ttcagcctga 
gaccggtccc 
tcgtacaagg 
acgtactttg 
actgcctaca 
gctactgctc 
gagcaagctg 
attacaaagg 
acatttcaac 
gcagctggga 
aaacccacaa 
gaaagtcaag 
aacttgactc 
catatttctt 
tctatgccca 
tacaacagca 
gtagatttgc 
gatagaacca 
agaattattg 
ggtgtgatta 
tgggaaaaag 
gccatggaaa 
ctgtatttgc 
aacacctacg 
aaccttggag 
cgcaatgctg 
cacatccagg 
tacacctacg 
aatgacctaa 
ttcttcccca 
aacgaccagt 
aacgctacca 
gccttcacgc 
tacacctact 
aagaaggtgg 
acccccaacg 
tgtaacatga 
cagggcttct 



tggctcgcga 
tttaaaagcc 
cggtgccggg 
cgggcggcat 
gcggtatcct 
ttgcatccgt 
atcaaaataa 
gaagacatca 
tggcaagata 
agcggcatta 
agcacaggcc 
gatggcctgg 
aagattaaca 
acagtgtctc 
gtgacgcaaa 
acccgtccca 
gacctgcctc 
gttgtaaccc 
cggcccgtag 
caatccctga 
tgcgtccatg 
ctaccccttc 
agtacctgag 
ataacaagtt 
agcgtttgac 
cgcggttcac 
acatccgcgg 
acgccctggc 
ttgaaataaa 
agcagcaaaa 

agggtattca 

ctgaacctca 
gagtccttaa 
atgaaaatgg 
tggaaatgca 
ctaaagtggt 
acatgcccac 
acaggcctaa 
cgggtaatat 
aagacagaaa 
ggtacttttc 
aaaatcatgg 
atacagagac 
atgctacaga 
tcaatctaaa 
ccgacaagct 
actacatgaa 
cacgctggtc 
gcctgcgcta 
tgcctcagaa 

agtggaactt 

gggttgacgg 
tggcccacaa 
cctttaacga 
acgtgcccat 
gccttaagac 
ctggctctat 
ccattacctt 
agtttgaaat 
ccaaagactg 
atatcccaga 



aggaggcagg 
ggtctttgtg 
attccgagga 
gcgtcgtgcg 
gcccctcctt 
ggccttgcag 
aaagtctgga 
actttgcgtc 
tcggcaccag 
aaaatttcgg 
agatgctgag 
cctctggcat 
gtaagcttga 
cagaggggcg 
tagacgagcc 
tcgcgcccat 
cccccgccga 
gtcctagccg 
ccagtggcaa 
agcgccgacg 
tcgccgccag 
gatgatgccg 
ccccgggctg 
tagaaacccc 
gctgcggttc 
cctagctgtg 
cgtgctggac 
tcccaagggt 
cctagaagaa 
aactcacgta 
aataggtgtc 
aataggagaa 
aaagactacc 
agggcaaggc 
atttttctca 
attgtacagt 
tattaaggaa 
ttacattgct 
gggtgttctg 
cacagagctt 
tatgtggaat 
aactgaagat 
tcttaccaag 
attttcagat 
tgccaacctg 
aaagtacagt 
caagcgagtg 
ccttgactat 
ccgctcaatg 
gttctttgcc 
caggaaggat 
agccagcatt 
caccgcctcc 
ctatctctcc 
atccatcccc 
taaggaaacc 
accctaccta 
tgactcttct 
taagcgctca 
gttcctggta 
gagctacaag 



accctggtgc 
gttcttgcag 
agaatgcacc 
caccaccggc 
attccactga 
gcgcagagac 
ctctcacgct 
tctggccccg 
caatatgagc 
ttccaccgtt 
ggataagttg 
tagcggggtg 
tccccgccct 
tggcgaaaag 
tccctcgtac 
ggctaccgga 
cacccagcag 
cgcgtccctg 
ctggcaaagc 
atgcttctga 
aggagctgct 
cagtggtctt 
gtgcagtttg 
acggtggcgc 
atccctgtgg 
ggtgataacc 
aggggcccta 
gccccaaatc 
gaggacgatg 
tttgggcagg 
gaaggtcaaa 
tctcagtggt 
ccaatgaaac 
attcttgtaa 
actactgagg 
gaagatgtag 
ggtaactcac 
tttagggaca 
gcgggccaag 
tcataccagc 
caggctgttg 
gaacttccaa 
gtaaaaccta 
aaaaatgaaa 
tggagaaatt 
ccttccaacg 

gtggctcccg 

atggacaacg 
ttgctgggca 
attaaaaacc 
gttaacatgg 
aagtttgata 
acgcttgagg 
gccgccaaca 
tcccgcaact 
ccatcactgg 
gatggaacct 
gtcagctggc 
gttgacgggg 
caaatgctag 
gaccgcatgt 



tgccaacagc 
atatggccct 
gtaggagggg 
ggcggcgcgc 
tcgccgcggc 
actgattaaa 
cgcttggtcc 
cgacacggct 
ggtggcgcct 
aagaactatg 
aaagagcaaa 
gtggacctgg 
cccgtagagg 
cgtccgcgcc 
gaggaggcac 
gtgctgggcc 
aaacctgtgc 
cgccgcgccg 
acactgaaca 
atagctaacg 
gagccgccgc 
acatgcacat 
cccgcgccac 
ctacgcacga 
accgtgagga 
gtgtgctgga 
cttttaagcc 
cttgcgaatg 
acaacgaaga 
cgccttattc 
cacctaaata 
acgaaactga 
catgttacgg 
agcaacaaaa 
cgaccgcagg 
atatagaaac 
gagaactaat 
attttattgg 
catcgcagtt 
ttttgcttga 
acagctatga 
attactgctt 
aaacaggtca 
taagagttgg 
tcctgtactc 
taaaaatttc 
ggttagtgga 
tcaacccatt 
atggtcgcta 
tccttctcct 
ttctgcagag 
gcatttgcct 
ccatgcttag 
tgctctaccc 

gggcggcttt 

gctcgggcta 
tttacctcaa 
ctggcaatga 
agggttacaa 
ctaactacaa 
actccttctt 



18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
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tagaaacttc cagcccatga gccgtcaggt ggtggatgat actaaataca aggactacca 2232 0 
acaggtgggc atcctacacc aacacaacaa ctctggattt gttggctacc ttgcccccac 223 80 
catgcgcgaa ggacaggcct accctgctaa cttcccctat ccgcttatag gcaagaccgc 22440 
agttgacagc attacccaga aaaagtttct ttgcgatcgc accctttggc gcatcccatt 2250 0 
ctccagtaac tttatgtcca tgggcgcact cacagacctg ggccaaaacc ttctctacgc 22560 
caactccgcc cacgcgctag acatgacttt tgaggtggat cccatggacg agcccaccct 22 62 0 
tctttatgtt ttgtttgaag tctttgacgt ggtccgtgtg caccggccgc accgcggcgt 22680 
catcgaaacc gtgtacctgc gcacgccctt ctcggccggc aacgccacaa cataaagaag 22 740 
caagcaacat caacaacagc tgccgccatg ggctccagtg agcaggaact gaaagccatt 22 80 0 
gtcaaagatc ttggttgtgg gccatatttt ttgggcacct atgacaagcg ctttccaggc 22 860 
tttgtttctc cacacaagct cgcctgcgcc atagtcaata cggccggtcg cgagactggg 22 92 0 
ggcgtacact ggatggcctt tgcctggaac ccgcactcaa aaacatgcta cctctttgag 22980 
ccctttggct tttctgacca gcgactcaag caggtttacc agtttgagta cgagtcactc 23040 
ctgcgccgta gcgccattgc ttcttccccc gaccgctgta taacgctgga aaagtccacc 2310 0 
caaagcgtac aggggcccaa ctcggccgcc tgtggactat tctgctgcat gtttctccac 23160 
gcctttgcca actggcccca aactcccatg gatcacaacc ccaccatgaa ccttattacc 23220 
ggggtaccca actccatgct caacagtccc caggtacagc ccaccctgcg tcgcaaccag 232 8 0 
gaacagctct acagcttcct ggagcgccac tcgccctact tccgcagcca cagtgcgcag 23 34 0 
attaggagcg ccacttcttt ttgtcacttg aaaaacatgt aaaaataatg tactagagac 23400 
actttcaata aaggcaaatg cttttatttg tacactctcg ggtgattatt tacccccacc 2346 0 
cttgccgtct gcgccgtttg gggaggcggc ggcgacgggg acggggacga cacgtcctcc 23520 
atggttgggg gacgtcgcgc cgcaccgcgt ccgcgctcgg gggtggtttc gcgctgctcc 23580 
tcttcccgac tggccatttc cttctcctat aggcagaaaa agatcatgga gtcagtcgag 23640 
aagaaggaca gcctaaccgc cccctctgag ttcgccacca ccgcctccac cgatgccgcc 23700 
aacgcgccta ccaccttccc cgtcgaggca cccccgcttg aggaggagga agtgattatc 23 760 
gagcaggacc caggttttgt aagcgaagac gacgaggacc gctcagtacc aacagaggat 2 3 820 
aaaaagcaag accaggacaa cgcagaggca aacgaggaac aagtcgggcg gggggacgaa 23 880 
aggcatggcg actacctaga tgtgggagac gacgtgctgt tgaagcatct gcagcgccag 23 940 
tgcgccatta tctgcgacgc gttgcaagag cgcagcgatg tgcccctcgc catagcggat 24000 
gtcagccttg cctacgaacg ccacctattc tcaccgcgcg taccccccaa acgccaagaa 24060 
aacggcacat gcgagcccaa cccgcgcctc aacttctacc ccgtatttgc cgtgccagag 2412 0 
gtgcttgcca cctatcacat ctttttccaa aactgcaaga tacccctatc ctgccgtgcc 24180 
aaccgcagcc gagcggacaa gcagctggcc ttgcggcagg gcgctgtcat acctgatatc 24240 
gcctcgctca acgaagtgcc aaaaatcttt gagggtcttg gacgcgacga gaagcgcgcg 243 0 0 
gcaaacgctc tgcaacagga aaacagcgaa aatgaaagtc actctggagt gttggtggaa 243 60 
ctcgagggtg acaacgcgcg cctagccgta ctaaaacgca gcatcgaggt cacccacttt 24420 
gcctacccgg cacttaacct accccccaag gtcatgagca cagtcatgag tgagctgatc 24480 
gtgcgccgtg cgcagcccct ggagagggat gcaaatttgc aagaacaaac agaggagggc 2454 0 
ctacccgcag ttggcgacga gcagctagcg cgctggcttc aaacgcgcga gcctgccgac 2460 0 
ttggaggagc gacgcaaact aatgatggcc gcagtgctcg ttaccgtgga gcttgagtgc 24 660 
atgcagcggt tctttgctga cccggagatg cagcgcaagc tagaggaaac attgcactac 24720 
acctttcgac agggctacgt acgccaggcc tgcaagatct ccaacgtgga gctctgcaac 247 80 
ctggtctcct accttggaat tttgcacgaa aaccgccttg ggcaaaacgt gcttcattcc 24840 
acgctcaagg gcgaggcgcg ccgcgactac gtccgcgact gcgtttactt atttctatgc 2490 0 
tacacctggc agacggccat gggcgtttgg cagcagtgct tggaggagtg caacctcaag 24960 
gagctgcaga aactgctaaa gcaaaacttg aaggacctat ggacggcctt caacgagcgc 25 02 0 
tccgtggccg cgcacctggc ggacatcatt ttccccgaac gcctgcttaa aaccctgcaa 25080 
cagggtctgc cagacttcac cagtcaaagc atgttgcaga actttaggaa ctttatccta 25140 
gagcgctcag gaatcttgcc cgccacctgc tgtgcacttc ctagcgactt tgtgcccatt 25200 
aagtaccgcg aatgccctcc gccgctttgg ggccactgct accttctgca gctagccaac 25260 
taccttgcct accactctga cataatggaa gacgtgagcg gtgacggtct actggagtgt 2532 0 
cactgtcgct gcaacctatg caccccgcac cgctccctgg tttgcaattc gcagctgctt 253 80 
aacgaaagtc aaattatcgg tacctttgag ctgcagggtc cctcgcctga cgaaaagtcc 25440 
gcggctccgg ggttgaaact cactccgggg ctgtggacgt cggcttacct tcgcaaattt 25500 
gtacctgagg actaccacgc ccacgagatt aggttctacg aagaccaatc ccgcccgcca 25560 
aatgcggagc ttaccgcctg cgtcattacc cagggccaca ttcttggcca attgcaagcc 25620 
atcaacaaag cccgccaaga gtttctgcta cgaaagggac ggggggttta cttggacccc 25680 
cagtccggcg aggagctcaa cccaatcccc ccgccgccgc agccctatca gcagcagccg 25740 
cgggcccttg cttcccagga tggcacccaa aaagaagctg cagctgccgc cgccacccac 2 58 00 
ggacgaggag gaatactggg acagtcaggc agaggaggtt ttggacgagg aggaggagga 25 860 
catgatggaa gactgggaga gcctagacga ggaagcttcc gaggtcgaag aggtgtcaga 25920 
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cgaaacaccg tcaccctcgg tcgcattccc ctcgccggcg ccccagaaat cggcaaccgg 25980 
ttccagcatg gctacaacct ccgctcctca ggcgccgccg gcactgcccg ttcgccgacc 26040 
caaccgtaga tgggacacca ctggaaccag ggccggtaag tccaagcagc cgccgccgtt 2610 0 
agcccaagag caacaacagc gccaaggcta ccgctcatgg cgcgggcaca agaacgccat 26160 
agttgcttgc ttgcaagact gtgggggcaa catctccttc gcccgccgct ttcttctcta 2622 0 
ccatcacggc gtggccttcc cccgtaacat cctgcattac taccgtcatc tctacagccc 26280 
atactgcacc ggcggcagcg gcagcggcag caacagcagc ggccacacag aagcaaaggc 26340 
gaccggatag caagactctg acaaagccca agaaatccac agcggcggca gcagcaggag 26400 
gaggagcgct gcgtctggcg cccaacgaac ccgtatcgac ccgcgagctt agaaacagga 2 6460 
tttttcccac tctgtatgct atatttcaac agagcagggg ccaagaacaa gagctgaaaa 2 652 0 
taaaaaacag gtctctgcga tccctcaccc gcagctgcct gtatcacaaa agcgaagatc 2 6580 
agcttcggcg cacgctggaa gacgcggagg ctctcttcag taaatactgc gcgctgactc 26640 
ttaaggacta gtttcgcgcc ctttctcaaa tttaagcgcg aaaactacgt catctccagc 2670 0 
ggccacaccc ggcgccagca cctgtcgtca gcgccattat gagcaaggaa attcccacgc 26760 
cctacatgtg gagttaccag ccacaaatgg gacttgcggc tggagctgcc caagactact 26820 
caacccgaat aaactacatg agcgcgggac cccacatgat atcccgggtc aacggaatcc 26880 
gcgcccaccg aaaccgaatt ctcttggaac aggcggctat taccaccaca cctcgtaata 26940 
accttaatcc ccgtagttgg cccgctgccc tggtgtacca ggaaagtccc gctcccacca 27000 
ctgtggtact tcccagagac gcccaggccg aagttcagat gactaactca ggggcgcagc 270 60 
ttgcgggcgg ctttcgtcac agggtgcggt cgcccgggca gggtataact cacctgacaa 27120 
tcagagggcg aggtattcag ctcaacgacg agtcggtgag ctcctcgctt ggtctccgtc 27180 
cggacgggac atttcagatc ggcggcgccg gccgtccttc attcacgcct cgtcaggcaa 27240 
tcctaactct gcagacctcg tcctctgagc cgcgctctgg aggcattgga actctgcaat 27300 
ttattgagga gtttgtgcca tcggtctact ttaacccctt ctcgggacct cccggccact 27360 
atccggatca atttattcct aactttgacg cggtaaagga ctcggcggac ggctacgact 2742 0 
gaatgttaag tggagaggca gagcaactgc gcctgaaaca cctggtccac tgtcgccgcc 27480 
acaagtgctt tgcccgcgac tccggtgagt tttgctactt tgaattgccc gaggatcata 27540 
tcgagggccc ggcgcacggc gtccggctta ccgcccaggg agagcttgcc cgtagcctga 2 760 0 
ttcgggagtt tacccagcgc cccctgctag ttgagcggga caggggaccc tgtgttctca 27 66 0 
ctgtgatttg caactgtcct aaccttggat tacatcaaga tctttgttgc catctctgtg 2772 0 
ctgagtataa taaatacaga aattaaaata tactggggct cctatcgcca tcctgtaaac 27780 
gccaccgtct tcacccgccc aagcaaacca aggcgaacct tacctggtac ttttaacatc 27840 
tctccctctg tgatttacaa cagtttcaac ccagacggag tgagtctacg agagaacctc 27900 
tccgagctca gctactccat cagaaaaaac accaccctcc ttacctgccg ggaacgtacg 27960 
agtgcgtcac cggccgctgc accacaccta ccgcctgacc gtaaaccaga ctttttccgg 2 8 020 
acagacctca ataactctgt ttaccagaac aggaggtgag cttagaaaac ccttagggta 2 8 080 
ttaggccaaa ggcgcagcta ctgtggggtt tatgaacaat tcaagcaact ctacgggcta 2 814 0 
ttctaattca ggtttctcta gaaatggacg gaattattac agagcagcgc ctgctagaaa 2 82 00 
gacgcagggc agcggccgag caacagcgca tgaatcaaga gctccaagac atggttaact 28260 
tgcaccagtg caaaaggggt atcttttgtc tggtaaagca ggccaaagtc acctacgaca 2 832 0 
gtaataccac cggacaccgc cttagctaca agttgccaac caagcgtcag aaattggtgg 2 8380 
tcatggtggg agaaaagccc attaccataa ctcagcactc ggtagaaacc gaaggctgca 2 8440 
ttcactcacc ttgtcaagga cctgaggatc tctgcaccct tattaagacc ctgtgcggtc 28500 
tcaaagatct tattcccttt aactaataaa aaaaaataat aaagcatcac ttacttaaaa 28560 
tcagttagca aatttctgtc cagtttattc agcagcacct ccttgccctc ctcccagctc 28620 
tggtattgca gcttcctcct ggctgcaaac tttctccaca atctaaatgg aatgtcagtt 28680 
tcctcctgtt cctgtccatc cgcacccact atcttcatgt tgttgcagat gaagcgcgca 2874 0 
agaccgtctg aagatacctt caaccccgtg tatccatatg acacggaaac cggtcctcca 28800 
actgtgcctt ttcttactcc tccctttgta tcccccaatg ggtttcaaga gagtccccct 28860 
ggggtactct ctttgcgcct atccgaacct ctagttacct ccaatggcat gcttgcgctc 28920 
aaaatgggca acggcctctc tctggacgag gccggcaacc ttacctccca aaatgtaacc 28980 
actgtgagcc cacctctcaa aaaaaccaag tcaaacataa acctggaaat atctgcaccc 29040 
ctcacagtta cctcagaagc cctaactgtg gctgccgccg cacctctaat ggtcgcgggc 2 910 0 
aacacactca ccatgcaatc acaggccccg ctaaccgtgc acgactccaa acttagcatt 29160 
gccacccaag gacccctcac agtgtcagaa ggaaagctag ccctgcaaac atcaggcccc 2 922 0 
ctcaccacca ccgatagcag tacccttact atcactgcct caccccctct aactactgcc 29280 
actggtagct tgggcattga cttgaaagag cccatttata cacaaaatgg aaaactagga 29340 
ctaaagtacg gggctccttt gcatgtaaca gacgacctaa acactttgac cgtagcaact 2 9400 
ggtccaggtg tgactattaa taatacttcc ttgcaaacta aagttactgg agccttgggt 29460 
tttgattcac aaggcaatat gcaacttaat gtagcaggag gactaaggat tgattctcaa 2 952 0 
aacagacgcc ttatacttga tgttagttat ccgtttgatg ctcaaaacca actaaatcta 29580 
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agactaggac 
aaaggccttt 
actgccaagg 
gaatttggtt 
ctagaatttg 
agcacaggtg 
ccagctccat 
ttaacaaaat 
ttggctccaa 
ggagtgctac 
cttactgaag 
ccaaaatctc 
gacaaaacta 
acaactccaa 
aatgaaatat 
gtttgtgtta 
attcagtagt 
tcacagaacc 
ttctccccgg 
tatattccac 
gggcagctca 
ttgcggttgc 
atcgtgcatc 
ccgctccgtc 
ccgcagcata 
agcacagtaa 
gtatccaaag 
gtagattaag 
gttgtaattc 
caccatccta 
actggaacaa 
gatatcaatg 
ctcccgcgtt 
actgcaggga 
cagcagcgga 
atccctactg 
aaatggaacg 
cagatctgcg 
actctctcaa 
ccgctgccct 
cgttctgcga 
tattccaaaa 
ccggtggcgt 
acaatggctt 
tcagggtgaa 
tctcgccacc 
aaaatctgct 
attcaggttc 
gatcccgtag 
gcgcggccac 
tactcggagc 
ataaaatgca 
tcgtagtcat 
accatttttc 
aaaacattta 
gacggactac 
ccaccgacag 
gttgattcat 
aggcgtagag 
aacacataaa 
acaacataca 



agggccctct 

acttgtttac 
ggttgatgtt 
cacctaatgc 
attcaaacaa 
ccattacagt 
ctcctaactg 
gtggcagtca 
tatctggaac 
taaacaattc 
gcacagccta 
acggtaaaac 
aacctgtaac 
gtgcatactc 
ttgccacatc 
tgtttcaacg 
atagccccac 
ctagtattca 
ctggccttaa 
acggtttcct 
cttaagttca 
ttaacgggcg 
aggatagggc 
ctgcaggaat 
aggcgccttg 
ctgcagcaca 
ctcatggcgg 

tggcgacccc 

accacctccc 
aaccagctgg 
tgacagtgga 
ttggcacaac 
agaaccatat 
agacctcgca 
tgatcctcca 
tacggagtgc 
ccggacgtag 
tctccggtct 
agcatccagg 
gataacatcc 
gtcacacacg 
gattatccaa 
ggtcaaactc 
ccaaaaggca 
tctcctctat 
ttctcaatat 
ccagagcgcc 
ctcacagacc 
gtcccttcgc 
ttccccgcca 
tatgctaacc 
aggtgctgct 
gctcatgcag 
tctcaaacat 
aacattagaa 
ggccatgccg 
ctcctcggtc 
cggtcagtgc 
acaacattac 
cacctgaaaa 
gcgcttcaca 



ttttataaac 
agcttcaaac 
tgacgctaca 
accaaacaca 
ggctatggtt 
aggaaacaaa 
tagactaaat 
aatacttgct 
agttcaaagt 
cttcctggac 
tacaaacgct 
tgccaaaagt 
actaaccatt 
tatgtcattt 
ctcttacact 
tgtttatttt 
caccacatag 
acctgccacc 
aaagcatcat 
gtcgagccaa 
tgtcgctgtc 
gcgaaggaga 

ggtggtgctg 

acaacatggc 
tcctccgggc 
gcaccacaat 
ggaccacaga 
tcataaacac 
ggtaccatat 
ccaaaacctg 
gagcccagga 
acaggcacac 
cccagggaac 
cgtaactcac 
gtatggtagc 
gccgagacaa 
tcatatttcc 
cgccgcttag 
cgccccctgg 
accaccgcag 
ggaggagcgg 
aacctcaaaa 
tacagccaaa 
aacggccctc 
aaacattcca 
atctctaagc 
ctccaccttc 
tgtataagat 
agggccagct 
ggaaccttga 
agcgtagccc 
caaaaaatca 
ataaaggcag 
gtctgcgggt 
gcctgtctta 
gcgtgaccgt 
atgtccggag 
taaaaagcga 
agcccccata 
accctcctgc 
gcggcagcct 



tcagcccaca 
aattccaaaa 
gccatagcca 
aatcccctca 
cctaaactag 
aataatgata 
gcagagaaag 
acagtttcag 
gctcatctta 
ccagaatatt 
gttggattta 
aacattgtca 
acactaaacg 
tcatgggact 
ttttcataca 
tcaattgcag 
cttatacaga 
tccctcccaa 
atcatgggta 
acgctcatca 
cagctgctga 
agtccacgcc 
cagcagcgcg 
agtggtctcc 
acagcagcgc 
attgttcaaa 
acccacgtgg 
gctggacata 
aaacctctga 
cccgccggct 
ctcgtaacca 
gtgcatacac 
aacccattcc 
gttgtgcatt 
gcgggtttct 
ccgagatcgt 
tgaagcaaaa 
atcgctctgt 
cttcgggttc 
aataagccac 
gaagagctgg 
tgaagatcta 
gaacagataa 
acgtccaagt 
gcaccttcaa 
aaatcccgaa 
agcctcaagc 
tcaaaagcgg 
gaacataatc 
caaaagaacc 
cgatgtaagc 
ggcaaagcct 
gtaagctccg 
ttctgcataa 
caacaggaaa 
aaaaaaactg 
tcataatgta 
ccgaaatagc 
ggaggtataa 
ctaggcaaaa 
aacagtcagc 



acttggatat 
agcttgaggt 
ttaatgcagg 
aaacaaaaat 
gaactggcct 
agctaacttt 
atgctaaact 
ttttggctgt 
ttataagatt 
ggaactttag 
tgcctaacct 
gtcaagttta 
gtacacagga 
ggtctggcca 
ttgcccaaga 
aaaatttcaa 
tcaccgtacc 
cacacagagt 
acagacatat 
gtgatattaa 
gccacaggct 
tacatggggg 
cgaataaact 
tcagcgatga 
accctgatct 
atcccacagt 
ccatcatacc 
aacattacct 
ttaaacatgg 
atacactgca 
tggatcatca 
ttcctcagga 
tgaatcagcg 
gtcaaagtgt 
gtctcaaaag 
gttggtcgta 
ccaggtgcgg 
gtagtagttg 
tatgtaaact 
acccagccaa 
aagaaccatg 
ttaagtgaac 
tggcatttgt 
ggacgtaaag 
ccatgcccaa 
tattaagtcc 
agcgaatcat 
aacattaaca 
gtgcaggtct 
cacactgatt 
tttgttgcat 
cgcgcaaaaa 
gaaccaccac 
acacaaaata 
aacaaccctt 
gtcaccgtga 
agactcggta 
ccgggggaat 
caaaattaat 
tagcaccctc 
cttaccagta 



taactacaac 
taacctaagc 
agatgggctt 
tggccatggc 
tagttttgac 
gtggaccaca 
cactttggtc 
taaaggcagt 
tgacgaaaat 
aaatggagat 
atcagcttat 
cttaaacgga 
aacaggagac 
caactacatt 
ataaagaatc 
gtcatttttc 
ttaatcaaac 
acacagtcct 
tcttaggtgt 
taaactcccc 
gctgtccaac 
tagagtcata 
gctgccgccg 
ttcgcaccgc 
cacttaaatc 
gcaaggcgct 
acaagcgcag 
cttttggcat 
cgccatccac 
gggaaccggg 
tgctcgtcat 
ttacaagctc 
taaatcccac 
tacattcggg 
gaggtagacg 
gtgtcatgcc 
gcgtgacaaa 
tagtatatcc 
ccttcatgcg 
cctacacatt 
tttttttttt 
gcgctcccct 
aagatgttgc 
gctaaaccct 
ataattctca 
ggccattgta 
gattgcaaaa 
aaaataccgc 
gcacggacca 
atgacacgca 
gggcggcgat 
agaaagcaca 
agaaaaagac 
aaataacaaa 
ataagcataa 
ttaaaaagca 
aacacatcag 
acatacccgc 
aggagagaaa 
ccgctccaga 
aaaaagaaaa 
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29700 
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29820 
29880 
29940 
30000 
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30240 
30300 
30360 
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30480 
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30660 
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30780 
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30900 
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31080 
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31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 



WO 2004/099422 



PCT/US2004/009219 



-98- 



cctattaaaa aaacaccact cgacacggca ccagctcaat cagtcacagt gtaaaaaagg 333 0 0 
gccaagtgca gagcgagtat atataggact aaaaaatgac gtaacggtta aagtccacaa 33360 
aaaacaccca gaaaaccgca cgcgaaccta cgcccagaaa cgaaagccaa aaaacccaca 33420 
acttcctcaa atcgtcactt ccgttttccc acgttacgta acttcccatt ttaagaaaac 33480 
tacaattccc aacacataca agttactccg ccctaaaacc tacgtcaccc gccccgttcc 33540 
cacgccccgc gccacgtcac aaactccacc ccctcattat catattggct tcaatccaaa 33 600 
ataaggtata ttattgatga tg 33 622 

<210> 53 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 5FF 



<210> 54 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 5FR 
<400> 54 

tccgcctcca tttagtgaac agttaggaga tggagctggt gtg 43 

<210> 55 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 3FF 



<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 3FR 
<400> 56 

gtggcaggtt gaatactagg 20 

<210> 57 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Penton 1 Oligonucleotide 



<400> 53 

gaacaggagg tgagcttaga 



20 



<400> 55 

tcactaaatg gaggcggaga tgctaaactc actttggtct taac 



44 



<400> 57 
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cgcggaagag aactccaacg cggcagccgc 

<210> 58 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Pent on 2 Oligonucleotide 
<400> 58 

tatcgttcat gtcctccacc ggctgcattg 

<210> 59 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Penton 3 Oligonucleotide 
<400> 59 

cgatagccgc ggctacccct acgacgtgcc 
tgaggagaag cgcgc 

<210> 60 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > Penton 4 Oligonucleotide 
<400> 60 

tcagcgcgct tctcctcagc ccgtgtggcg 
gggtagccgc ggc 

<210> 61 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hexon Forward Primer 
<400> 61 

cttcgatgat gccgcagtg 

<210> 62 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hexon Reverse Primer 
<400> 62 

gggctcaggt actccgagg 

<210> 63 
<211> 25 



ggcaatgcag ccggtggagg acatgaa 57 



ccgcggctgc cgcgttggag ttctcttcc 59 



cgactacgcg ggcaccagcg ccacacgggc 60 

75 



ctggtgcccg cgtagtcggg cacgtcgtag 60 

73 



19 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Hexon Probe 
<400> 63 

ttacatgcac atctcgggcc aggac 25 

<210> 64 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5HSPR primer 



<210> 65 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P-0005/U primer 
<400> 65 

ctctagaaat ggacggaatt attacag 27 

<210> 66 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P-0006/L primer 



<210> 67 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P-0007/U primer 
<400> 67 

gttgttgcag atgaccaaga gagtccggct ca 32 

<210> 68 
<211> 73 
<212> DNA 

<213> Artificial Sequence 



<400> 64 

ggctccggct ccgagaggtg ggctcacagt ggttacattt 



40 



<400> 66 

tcttggtcat ctgcaacaac atgaagatag tg 



32 



<220> 

<223> 35FMun primer 
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<400> 68 



agcaattgaa aaataaacac gttgaaacat aacacaaacg attctttagt tgtcgtcttc 6 0 
tgtaatgtaa gaa 73 

<210> 69 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P-0009 primer 



<210> 70 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer PI 
<400> 70 

gaacaggagg tgagcttaga 2 0 

<210> 71 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P2 



<210> 72 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P3 
<400> 72 

gataagctaa ctttgtggac cggaataaac cctccaccta ac 42 

<210> 73 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P4 



<400> 69 

agcaattgaa aaataaacac gttg 



24 



<400> 71 

gttaggtgga gggtttattc cggtccacaa agttagctta tc 



42 



<400> 73 

gtggcaggtt gaatactagg 



20 



<210> 74 
<211> 41 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P5 
<400> 74 

gttaggagat ggagctggtg tagtccataa ggtgttaata c 41 

<210> 75 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P6 
<400> 75 

gtattaacac cttatggact acaccagctc catctcctaa c 41 

<210> 76 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P7 
<400> 76 

tgcgcaaaaa caatcaccac gacaatcaca atgtacattg gaagaaatca tacg . 54 

<210> 77 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer P8 
<400> 77 

acattgtgat tgtcgtggtg attgtttttg cgcatatgcc atacaatttg aatg 54 

<210> 78 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> cRGD peptide 
<400> 78 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys 
1 5 10 



<210> 79 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 



WO 2004/099422 



PCT/US2004/009219 



-103- 



<223> P-0010/L primer 
<400> 79 

ttcttttcat ctgcaacaac atgaagatag fcg 

<210> 80 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> P-0011/U primer 
<400> 80 

gttgttgcag atgaaaagaa ccagaattga ag 

<210> 81 
<211> 73 
<212> DNA 

<213> Artificial Sequence 



32 



32 



<220> 

<223> P-0012/L primer 
<400> 81 

tgcaattgaa aaataaacac gttgaaacat aacacaaacg attctttatt cttcagttat 60 
gtagcaaaat aca 73 

<210> 82 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 41SRGDR Primer 
<400> 82 

agtacaaaaa caatcaccac gacaatcaca gtttatctcg ttgtagacga cactga 56 

<210> 83 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41SRGDF Primer 
<400> 83 

tgtgattgtc gtggtgattg tttttgtact agtgggtatg cttttacttt t 51 

<210> 84 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> L37 Primer 



<400> 84 

tgtcttggat ccaagatgaa gcgcgcccgc cccagcgaag atgacttc 



48 
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<210> 85 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 37FR Primer 
<400> 85 

aaacacggcg gccgctcttt cattcttg 2 8 

<210> 86 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Native Acl37 N-terminus 
<400> 86 

Met Ser Lys Arg Leu Arg Val Glu 
1 5 



<210> 87 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified Ad37 N-terminus 
<400> 87 

Met Lys Arg Ala Arg Pro Ser Glu 
1 5 



<210> 88 
<211> 1240 
<212> DNA 

<213> Artificial SGqueriCG 
<220> 

<223> Ad5 TPL sequence 



<400> 88 

ggatccactc 

ctgaaaagcg 

gatattcacc 

gacaatcttt 

cttggcgatg 

gtttagctgc 

gtcgggcacc 

ggctacctct 

gaatggcggt 

cccgggcagc 

ctgccatgcg 

ggggtgggtg 

gagtattcca 
gtatagttcg 
tgctcggaag 



tcttccgcat 
ggcatgactt 
tggcccgcgg 
ttgttgtcaa 
gagcgcaggg 
acgtattcgc 
aggtgcacgc 
ccgcgtaggc 
agggggtcta 
aggcgcgcgt 
cgggcggcaa 
agcgcggagg 
agatatgtag 
tgcgagggag 
actatctgcc 



cgctgtctgc 
ctgcgctaag 
tgatgccttt 
gcttggtggc 
tttggttttt 
gcgcaacgca 
gccaaccgcg 
gctcgttggt 
gctgcgtctc 
cgaagtagtc 
gcgcgcgctc 
cgtacatgcc 
ggtagcatct 
cgaggaggtc 
tgaagatggc 



gagggccagc 
attgtcagtt 
gagggtggcc 
aaacgacccg 
gtcgcgatcg 
ccgccattcg 
gttgtgcagg 
ccagcagagg 
gtccgggggg 
tatcttgcat 
gtatgggttg 
gcaaatgtcg 
tccaccgcgg 
gggaccgagg 
atgtgagttg 



tgttggggtg 
tccaaaaacg 
gcatccatct 
tagagggcgt 
gcgcgctcct 
ggaaagacgg 
gtgacaaggt 
cggccgccct 
tctgcgtcca 
ccttgcaagt 
agtgggggac 
taaacgtaga 
atgctggcgc 
ttgctacggg 
gatgatatgg 



agtactccct 60 
aggagga 1 1 1 12 0 
ggtcagaaaa 18 0 
tggacagcaa 24 0 
tggccgcgat 3 00 

tggtgcgctc 3 60 

caacgctggt 42 0 
tgcgcgagca 4 80 
cggtaaagac 54 0 
ctagcgcctg 60 0 
cccatggcat 660 

ggggctctct 720 

gcacgtaatc 7 80 

cgggctgctc 840 

ttggacgctg 90 0 
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gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 

gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 102 0 

ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 1080 

gacaaactct tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 114 0 

agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 12 0 0 

tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct ~ 124 0 

<210> 89 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F16 5' 

<400> 89 

ccggtctacc catatgaaga tg 22 

<210> 90 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F16 3' 
<400> 90 

tggtgcggcc gctcagtcat cttctctg 2 8 

<210> 91 
<211> 34 
<212> DNA 

<213> qArtificial Sequence 
<220> 

<223> Primer F35 3' 
<400> 91 

tggtgcggcc gcttagttgt cgtcttctgt aatg 34 

<210> 92 
<211> 10837 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid p5FloxHRF 
<400> 92 

ggaatacaac atggcagtgg tctcctcagc gatgattcgc accgcccgca gcataaggcg 60 
ccttgtcctc cgggcacagc agcgcaccct gatctcactt aaatcagcac agtaactgca 12 0 
gcacagcacc acaatattgt tcaaaatccc acagtgcaag gcgctgtatc caaagctcat 180 
ggcggggacc acagaaccca cgtggccatc ataccacaag cgcaggtaga ttaagtggcg 24 0 
acccctcata aacacgctgg acataaacat tacctctttt ggcatgttgt aattcaccac 3 00 
ctcccggtac catataaacc tctgattaaa catggcgcca tccaccacca tcctaaacca 360 
gctggccaaa acctgcccgc cggctataca ctgcagggaa ccgggactgg aacaatgaca 42 0 
gtggagagcc caggactcgt aaccatggat catcatgctc gtcatgatat caatgttggc 48 0 
acaacacagg cacacgtgca tacacttcct caggattaca agctcctccc gcgttagaac 54 0 
catatcccag ggaacaaccc attcctgaat cagcgtaaat cccacactgc agggaagacc 60 0 
tcgcacgtaa ctcacgttgt gcattgtcaa agtgttacat tcgggcagca gcggatgatc 660 
ctccagtatg gtagcgcggg tttctgtctc aaaaggaggt agacgatccc tactgtacgg 720 
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agtgcgccga 
cgtagtcata 
ggtctcgccg 
ccaggcgccc 
catccaccac 
acacgggagg 
tccaaaacct 
aactctacag 
aggcaaacgg 
tctataaaca 
aatatatctc 
gcgccctcca 
agacctgtat 
ttcgcagggc 
cgccaggaac 
taaccagcgt 
ctgctcaaaa 
tgcagataaa 
aacatgtctg 
tagaagcctg 
tgccggcgtg 
cggtcatgtc 
agtgctaaaa 
attacagccc 
gaaaaaccct 
tcacagcggc 
ccactcgaca 
agtatatata 
ccgcacgcga 
cacttccgtt 
acacatacaa 
ccacgtcaca 
tattgatgat 
tcgccctata 
gaaaaccctg 
cgtaatagcg 
gaatggcgcg 
agcgtgaccg 
tttctcgcca 
ttccgattta 
cgtagtgggc 
tttaatagtg 
tttgatttat 
caaaaattta 
ttcggggaaa 
atccgctcat 
tgagtattca 
tttttgctca 
gagtgggtta 
aagaacgttt 
gtattgacgc 
ttgagtactc 
gcagtgctgc 
gaggaccgaa 
atcgttggga 
ctgtagcaat 
cccggcaaca 
cggcccttcc 
gcggtatcat 
cgacggggag 
cactgattaa 



gacaaccgag 
tttcctgaag 
cttagatcgc 
cctggcttcg 
cgcagaataa 
agcgggaaga 
caaaatgaag 
ccaaagaaca 
ccctcacgtc 
ttccagcacc 
taagcaaatc 
ccttcagcct 
aagattcaaa 
cagctgaaca 
cttgacaaaa 
agccccgatg 
aatcaggcaa 
ggcaggtaag 

c gggtttctg 

tcttacaaca 
accgtaaaaa 
cggagtcata 
agcgaccgaa 
ccataggagg 
cctgcctagg 
agcctaacag 
cggcaccagc 
ggactaaaaa 
acctacgccc 
ttcccacgtt 
gttactccgc 
aactccaccc 
ggatcagctt 
gtgagtcgta 
gcgttaccca 
aagaggcccg 
acgcgccctg 
ctacacttgc 
cgttcgccgg 
gtgctttacg 
catcgccctg 
gactcttgtt 
aagggatttt 
acgcgaattt 
tgtgcgcgga 
gagacaataa 
acatttccgt 
cccagaaacg 
catcgaactg 
tccaatgatg 
cgggcaagag 
accagtcaca 
cataaccatg 
ggagctaacc 
accggagctg 
ggcaacaacg 
attaatagac 
ggctggctgg 
tgcagcactg 
tcaggcaact 
gcattggtaa 



atcgtgttgg 
caaaaccagg 
tctgtgtagt 
ggttctatgt 
gccacaccca 
gctggaagaa 
atctattaag 
gataatggca 
caagtggacg 
ttcaaccatg 
ccgaatatta 
caagcagcga 
agcggaacat 
taatcgtgca 
gaacccacac 
taagctttgt 
agcctcgcgc 
ctccggaacc 
cataaacaca 
ggaaaaacaa 
aactggtcac 
atgtaagact 
atagcccggg 
tataacaaaa 
caaaatagca 
tcagccttac 
tcaatcagtc 
atgacgtaac 
agaaacgaaa 
acgtcacttc 
cctaaaacct 
cctcattatc 
atcgataccg 
ttacaattca 
acttaatcgc 
caccgatcgc 
tagcggcgca 
cagcgcccta 
ctttccccgt 
gcacctcgac 
atagacggtt 
ccaaactgga 
gccgatttcg 
taacaaaata 
acccctattt 
ccctgataaa 
gtcgccctta 
ctggtgaaag 
gatctcaaca 
agcactttta 
caactcggtc 
gaaaagcatc 
agtgataaca 
gcttttttgc 
aatgaagcca 
ttgcgcaaac 
tggatggagg 
tttattgctg 
gggccagatg 
atggatgaac 
ctgtcagacc 



tcgtagtgtc 
tgcgggcgtg 
agttgtagta 
aaactccttc 
gccaacctac 
ccatgttttt 
tgaacgcgct 
tttgtaagat 
taaaggctaa 
cccaaataat 
agtccggcca 
atcatgattg 
taacaaaaat 
ggtctgcacg 
tgattatgac 
tgcatgggcg 
aaaaaagaaa 
accacagaaa 
aaataaaata 
cccttataag 
cgtgattaaa 
cggtaaacac 
ggaatacata 
ttaataggag 
ccctcccgct 
cagtaaaaaa 
acagtgtaaa 
ggttaaagtc 
gccaaaaaac 
ccattttaat 
acgtcacccg 
atattggctt 
tcgacctcga 
ctggccgtcg 
cttgcagcac 
ccttcccaac 
ttaagcgcgg 
gcgcccgctc 
caagctctaa 
cccaaaaaac 
tttcgccctt 
acaacactca 
gcctattggt 
ttaacgttta 
gtttattttt 
tgcttcaata 
ttcccttttt 
taaaagatgc 
gcggtaagat 
aagttctgct 
gccgcataca 
ttacggatgg 
ctgcggccaa 
acaacatggg 
taccaaacga 
tattaactgg 
cggataaagt 
ataaatctgg 
gtaagccctc 
gaaatagaca 
aagtttactc 



atgccaaatg 
acaaacagat 
tatccactct 
atgcgccgct 
acattcgttc 
ttttttattc 
cccctccggt 
gttgcacaat 
acccttcagg 
tctcatctcg 
ttgtaaaaat 
caaaaattca 
accgcgatcc 
gaccagcgcg 
acgcatactc 
gcgatataaa 
gcacatcgta 
aagacaccat 
acaaaaaaac 
cataagacgg 
aagcaccacc 
atcaggttga 
cccgcaggcg 
agaaaaacac 
ccagaacaac 
gaaaacctat 
aaagggccaa 
cacaaaaaac 
ccacaacttc 
taagaaaact 
ccccgttccc 
caatccaaaa 

gggggggccc 

ttttacaacg 
atcccccttt 
agttgcgcag 
cgggtgtggt 
ctttcgcttt 
atcgggggct 
ttgatfcaggg 
tgacgttgga 
accctatctc 
taaaaaatga 
caatttccca 
ctaaatacat 
atattgaaaa 
tgcggcattt 
tgaagatcag 
ccttgagagt 
atgtggcgcg 
ctattctcag 
catgacagta 
cttacttctg 
ggatcatgta 
cgagcgtgac 
cgaactactt 
tgcaggacca 
agccggtgag 
ccgtatcgta 
gatcgctgag 
atatatactt 



gaacgccgga 
ctgcgtctcc 
ctcaaagcat 
gccctgataa 
tgcgagtcac 
caaaagatta 
ggcgtggtca 
ggcttccaaa 
gtgaatctcc 
ccaccttctc 
ctgctccaga 
ggttcctcac 
cgtaggtccc 
gccacttccc 
ggagctatgc 
atgcaaggtg 
gtcatgctca 
ttttctctca 
atttaaacat 
actacggcca 
gacagctcct 
ttcatcggtc 
tagagacaac 
ataaacacct 
atacagcgct 
taaaaaaaca 
gtgcagagcg 
acccagaaaa 
ctcaaatcgt 
acaattccca 
acgccccgcg 
taaggtatat 
ggtacccaat 
tcgtgactgg 
cgccagctgg 
cctgaatggc 
ggttacgcgc 
cttcccttcc 
ccctttaggg 
tgatggttca 
gtccacgttc 
ggtctattct 
gctgatttaa 

ggtggcactt 

tcaaatatgt 
aggaagagta 
tgccttcctg 
ttgggtgcac 
tttcgccccg 
gtattatccc 
aatgacttgg 
agagaattat 
acaacgatcg 
actcgccttg 
accacgatgc 
actctagctt 
cttctgcgct 
cgtgggtctc 
gttatctaca 
ataggtgcct 
tagattgatt 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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taaaacttca tttttaattt aaaaggatct 
ccaaaatccc ttaacgtgag ttttcgttcc 
aaggatcttc ttgagatcct ttttttctgc 
caccgctacc agcggtggtt tgtttgccgg 
taactggctt cagcagagcg cagataccaa 
gccaccactt caagaactct gtagcaccgc 
cagtggctgc tgccagtggc gataagtcgt 
taccggataa ggcgcagcgg tcgggctgaa 
agcgaacgac ctacaccgaa ctgagatacc 
ttcccgaagg gagaaaggcg gacaggtatc 
gcacgaggga gcttccaggg ggaaacgcct 
acctctgact tgagcgtcga tttttgtgat 
acgccagcaa cgcggccttt ttacggttcc 
tctttcctgc gttatcccct gattctgtgg 
ataccgctcg ccgcagccga acgaccgagc 
agcgcccaat acgcaaaccg cctctccccg 
acgacaggtt tcccgactgg aaagcgggca 
tcactcatta ggcaccccag gctttacact 
ttgtgagcgg ataacaattt cacacaggaa 
gaattaaccc tcactaaagg gaacaaaagc 
ggatggcacc caaaaagaag ctgcagctgc 
gggacagtca ggcagaggag gttttggacg 
agagcctaga cgaggaagct tccgaggtcg 
cggtcgcatt cccctcgccg gcgccccaga 
cctccgctcc tcaggcgccg ccggcactgc 
ccactggaac cagggccggt aagtccaagc 
agcgccaagg ctaccgctca tggcgcgggc 
actgtggggg caacatctcc ttcgcccgcc 
tcccccgtaa catcctgcat tactaccgtc 
gcggcagcgg cagcaacagc agcggccaca 
ctgacaaagc ccaagaaatc cacagcggcg 
gcgcccaacg aacccgtatc gacccgcgag 
gctatatttc aacagagcag gggccaagaa 
cgatccctca cccgcagctg cctgtatcac 
gaagacgcgg aggctctctt cagtaaatac 
gccctttctc aaatttaagc gcgaaaacta 
gcacctgtcg tcagcgccat tatgagcaag 
cagccacaaa tgggacttgc ggctggagct 
atgagcgcgg gaccccacat gatatcccgg 
attctcctgg aacaggcggc tattaccacc 
tggcccgctg ccctggtgta ccaggaaagt 
gacgcccagg ccgaagttca gatgactaac 
cacagggtgc ggtcgcccgg gcagggtata 
cagctcaacg acgagtcggt gagctcctcg 
atcggcggcg ccggccgctc ttcattcacg 
tcgtcctctg agccgcgctc tggaggcatt 
ccatcggtct actttaaccc cttctcggga 
cctaactttg acgcggtaaa ggactcggcg 
gcagagcaac tgcgcctgaa acacctggtc 
gactccggtg agttttgcta ctttgaattg 
ggcgtccggc ttaccgccca gggagagctt 
cgccccctgc tagttgagcg ggacagggga 
cctaaccttg gattacatca agatctttgt 
agaaattaaa atatactggg gctcctatcg 
cccaagcaaa ccaaggcgaa ccttacctgg 
caacagtttc aacccagacg gagtgagtct 
catcagaaaa aacaccaccc tccttacctg 
tgcaccacac ctaccgcctg accgtaaacc 
tgtttaccag aacaggaggt gagcttagaa 
ctactgtggg gtttatgaac aattcaagca 
ctagataact tcgtataatg tatgctatac 



aggtgaagat cctttttgat aatctcatga 444 0 
actgagcgtc agaccccgta gaaaagatca 4500 
gcgtaatctg ctgcttgcaa acaaaaaaac 4560 
atcaagagct accaactctt tttccgaagg 4620 
atactgtcct tctagtgtag ccgtagttag 4680 
ctacatacct cgctctgcta atcctgttac 4740 
gtcttaccgg gttggactca agacgatagt 4 80 0 
cggggggttc gtgcacacag cccagcttgg 4 86 0 
tacagcgtga gctatgagaa agcgccacgc 492 0 
cggtaagcgg cagggtcgga acaggagagc 4 98 0 
ggtatcttta tagtcctgtc gggtttcgcc 5040 
gctcgtcagg ggggcggagc ctatggaaaa 510 0 
tggccttttg ctggcctttt gctcacatgt 5160 
ataaccgtat taccgccttt gagtgagctg 522 0 
gcagcgagtc agtgagcgag gaagcggaag 52 8 0 
cgcgttggcc gattcattaa tgcagctggc 5340 
gtgagcgcaa cgcaattaat gtgagttagc 540 0 
ttatgcttcc ggctcgtatg ttgtgtggaa 5460 
acagctatga ccatgattac gccaagctcg 552 0 
tggagctcca ccgcgggccc ttgcttccca 5580 
cgccgccacc cacggacgag gaggaatact 5 64 0 
aggaggagga ggacatgatg gaagactggg 570 0 
aagaggtgtc agacgaaaca ccgtcaccct 5760 
aatcggcaac cggttccagc atggctacaa 582 0 
ccgttcgccg acccaaccgt agatgggaca 5880 
agccgccgcc gttagcccaa gagcaacaac 5940 
acaagaacgc catagttgct tgcttgcaag 60 0 0 
gctttcttct ctaccatcac ggcgtggcct 6060 
atctctacag ccca.ta.ctgc accggcggca 612 0 
cagaagcaaa ggcgaccgga tagcaagact 618 0 
gcagcagcag gaggaggagc gctgcgtctg 6240 
cttagaaaca ggatttttcc cactctgtat 6300 
caagagctga aaataaaaaa caggtctctg 6360 
aaaagcgaag atcagcttcg gcgcacgctg 642 0 
tgcgcgctga ctcttaagga ctagtttcgc 6480 
cgtcatctcc agcggccaca cccggcgcca 654 0 
gaaattccca cgccctacat gtggagttac 660 0 
gcccaagact actcaacccg aataaactac 6660 
gtcaacggaa tccgcgccca ccgaaaccga 6720 
acacctcgta ataaccttaa tccccgtagt 6780 
cccgctccca ccactgtggt acttcccaga 6840 
tcaggggcgc agcttgcggg cggctttcgt 6900 
actcacctga caatcagagg gcgaggtatt 6960 
cttggtctcc gtccggacgg gacatttcag 7020 
cctcgtcagg caatcctaac tctgcagacc 70 80 
ggaactctgc aatttattga ggagtttgtg 7140 
cctcccggcc actatccgga tcaatttatt 72 00 
gacggctacg actgaatgtt aagtggagag 72 60 
cactgtcgcc gccacaagtg ctttgcccgc 7320 
cccgaggatc atatcgaggg cccggcgcac 73 8 0 
gcccgtagcc tgattcggga gtttacccag 7440 
ccctgtgttc tcactgtgat ttgcaactgt 7500 
tgccatctct gtgctgagta taataaatac 7560 
ccatcctgta aacgccaccg tcttcacccg 7620 
tacttttaac atctctccct ctgtgattta 7680 
acgagagaac ctctccgagc tcagctactc 7740 
ccgggaacgt acgagtgcgt caccggccgc 78 0 0 
agactttttc cggacagacc tcaataactc 7860 
aacccttagg gtattaggcc aaaggcgcag 792 0 
actctacggg ctattctaat tcaggtttct 7980 
gaagttatgc tagaaatgga cggaattatt 804 0 
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acagagcagc gcctgctaga aagacgcagg gcagcggccg agcaacagcg catgaatcaa 810 0 
gagctccaag acatggttaa cttgcaccag tgcaaaaggg gtatcttttg tctggtaaag 816 0 
caggccaaag tcacctacga cagtaatacc accggacacc gccttagcta caagttgcca 8220 
accaagcgtc agaaattggt ggtcatggtg ggagaaaagc ccattaccat aactcagcac 82 8 0 
tcggtagaaa ccgaaggctg cattcactca ccttgtcaag gacctgagga tctctgcacc 8340 
cttattaaga ccctgtgcgg tctcaaagat cttattccct ttaactaata aaaaaaaata 84 0 0 
ataaagcatc acttacttaa aatcagttag caaatttctg tccagtttat tcagcagcac 8460 
ctccttgccc tcctcccagc tctggtattg cagcttcctc ctggctgcaa actttctcca 8520 
caatctaaat ggaatgtcag tttcctcctg ttcctgtcca tccgcaccca ctatcttcat 8580 
gttgttgcag atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata 864 0 
tgacacggaa accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa 8700 
tgggtttcaa gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac 876 0 
ctccaatggc atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa 882 0 
ccttacctcc caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat 8880 
aaacctggaa atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc 8940 
cgcacctcta atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt 900 0 
gcacgactcc aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct 9060 
agccctgcaa acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc 912 0 
ctcaccccct ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta 9180 
tacacaaaat ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct 9240 
aaacactttg accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac 93 0 0 
taaagttact ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg 93 60 
aggactaagg attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga 942 0 
tgctcaaaac caactaaatc taagactagg acagggccct ctttttataa actcagccca 9480 
caacttggat attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa 9540 
aaagcttgag gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc 9600 
cattaatgca ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct 9660 
caaaacaaaa attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact 972 0 
aggaactggc cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga 9780 
taagctaact ttgtggacca caccagctcc atctcctaac tgtagactaa atgcagagaa 984 0 
agatgctaaa ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc 990 0 
agttttggct gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct 9960 
tattataaga tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata 10 02 0 
ttggaacttt agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt 10 0 80 
tatgcctaac ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt 10140 
cagtcaagtt tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa 10200 
cggtacacag gaaacaggag acacaactcc aagtgcatac tctatgtcat tttcatggga 102 60 
ctggtctggc cacaactaca ttaatgaaat atttgccaca tcctcttaca ctttttcata 10320 
cattgcccaa gaataaagaa tcgtttgtgt tatgtttcaa cgtgtttatt tttcaattgc 10380 
agaaaatttc aagtcatttt tcattcagta gtatagcccc accaccacat agcttataca 10440 
gatcaccgta ccttaatcaa actcacagaa ccctagtatt caacctgcca cctccctccc 105 00 
aacacacaga gtacacagtc ctttctcccc ggctggcctt aaaaagcatc atatcatggg 10560 
taacagacat attcttaggt gttatattcc acacggtttc ctgtcgagcc aaacgctcat 10620 
cagtgatatt aataaactcc ccgggcagct cacttaagtt catgtcgctg tccagctgct 10680 
gagccacagg ctgctgtcca acttgcggtt gcttaacggg cggcgaagga gaagtccacg 10740 
cctacatggg ggtagagtca taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg 10800 
cgcgaataaa ctgctgccgc cgccgctccg tcctgca 10837 



<210> 93 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Muni TOP oligo 
<400> 93 

aattgtgtta tgtttaaacg tgtttatttt tg 



32 



<210> 94 
<211> 32 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Muni BOTTOM oligo 
<400> 94 

aattcaaaaa taaacacgtt taaacataac ac 32 

<210> 95 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F37 5 ' SphI 



<210> 96 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F37 3 ' EcoRI 
<400> 96 

aaacacggga attcgtcttt cattc 25 

<210> 97 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F16 5 ' SphI 



<210> 98 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer F16 3 ' Muni 
<400> 98 

tttatcaatt gtgttgtcag tcatcttc 28 

<210> 99 
<211> 25 
<212> DNA 

<213> Artificial Sequence 



<400> 95 

taccaatggc atgctatccc tcaagg 



26 



<400> 97 

gccagcggca tgctccaact taaa 



24 



<220> 

<223> Forward Primer (TPL exon 3) 
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<400> 99 

ctcaacaatt gtggatccgt actcc 25 

<210> 100 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse Primer (TPL exon 3) 
<400> 100 

gtgctcagca gatcttgcga ctgtg 25 

<210> 101 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Forward Primer (TPL exons 1/2) 
<400> 101 

ggcgcgttcg gatccactct cttcc 25 

<210> 102 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Reverse Primer (TPL exons 1/2) 
<400> 102 

ctacatgcta ggcagatctc gttcggag 2 8 

<210> 103 
<211> 1240 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> TPL sequence with restriction sites 
<400> 103 

ggatccactc tcttccgcat cgctgtctgc gagggccagc tgttggggtg agtactccct 60 
ctgaaaagcg ggcatgactt ctgcgctaag attgtcagtt tccaaaaacg aggaggattt 120 
gatattcacc tggcccgcgg tgatgccttt gagggtggcc gcatccatct ggtcagaaaa 18 0 
gacaatcttt ttgttgtcaa gcttggtggc aaacgacccg tagagggcgt tggacagcaa 240 
cttggcgatg gagcgcaggg tttggttttt gtcgcgatcg gcgcgctcct tggccgcgat 3 00 
gtttagctgc acgtattcgc gcgcaacgca ccgccattcg ggaaagacgg tggtgcgctc 360 
gtcgggcacc aggtgcacgc gccaaccgcg gttgtgcagg gtgacaaggt caacgctggt 42 0 
ggctacctct ccgcgtaggc gctcgttggt ccagcagagg cggccgccct tgcgcgagca 480 
gaatggcggt agggggtcta gctgcgtctc gtccgggggg tctgcgtcca cggtaaagac 54 0 
cccgggcagc aggcgcgcgt cgaagtagtc tatcttgcat ccttgcaagt ctagcgcctg 60 0 
ctgccatgcg cgggcggcaa gcgcgcgctc gtatgggttg agtgggggac cccatggcat 660 
ggggtgggtg agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct 72 0 
gagtattcca agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc 780 
gtatagttcg tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc 84 0 
tgctcggaag actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg 9 00 
gaagacgttg aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga 960 
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gtcgcgcagc ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag 1020 
ggtttccttg atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag 10 80 
gacaaactct tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg 1140 
agatccgtac tccgccgccg agggacctga gcgagtccgc atcgaccgga tcggaaaacc 12 0 0 
tctcgagaaa ggcgtctaac cagtcacagt cgcaagatct " ' 1240 

<210> 104 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pBHGlO forward primer 
<400> 104 

tgtacaccgg atccggcgca cacc 24 

<210> 105 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pBHGlO reverse primer 
<400> 105 

cacaacgagc tcaattaatt aattgccaca tcctc 35 

<210> 106 
<211> 32480 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5.iSgal.AF 



<400> 106 

catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 

agtgaaatct 

aagcttgaat 

tatgcttccg 

cagctatgac 

aggaacttgg 

agggtgtgga 

ttagtcagca 

catgcatctc 

aactccgccc 

agaggccgag 

aggcctaggc 

tcgaacactt 

atcgtcacct 

tctgaacaat 

gaccagaaac 

ggcgagatcg 

cttaatcgcc 

accgatcgcc 

ccggcaccag 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
tcgattaatg 
gctcgtatgt 
catgattacg 
ttaggtacct 
aagtccccag 
accaggtgtg 
aattagtcag 
agttccgccc 
gccgcctcgg 
ttttgcaaaa 
tttaagccgt 
gggacatgtt 
ggaaaggcat 
agcacctcga 
atcccgtcgt 
ttgcagcaca 
cttcccaaca 
aagcggtgcc 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
tgagttagct 
tgtgtggaat 
aattcggcgc 
tctgaggcgg 
gctccccagc 
gaaagtcccc 
caaccatagt 
attctccgcc 
cctctgagct 
agcttgggat 
agataaacag 
gcagatccat 
tattgccgta 
actgagccgc 
tttacaacgt 
tccccctttc 
gttgcgcagc 
ggaaagctgg 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
cactcattag 
tgtgagcgga 
agcaccatgg 
aaagaaccag 
aggcagaagt 
aggctcccca 
cccgccccta 
ccatggctga 
attccagaag 
ctctataatc 
gctgggacac 
gcacgtaaac 
agccgtggcg 
gatattgccc 
cgtgactggg 
gccagctggc 
ctgaatggcg 
ctggagtgcg 



tgataatgag 
tagtagtgtg 
tggcaaaagt 
gttttaggcg 
gggaaaactg 
tctctagcat 
gcaccccagg 
taacaatttc 
cctgaaataa 
ctgtggaatg 
atgcaaagca 
gcaggcagaa 
actccgccca 
ctaatttttt 
tagtgaggag 
tcgcgcaacc 
ttcacatgag 
tcgcaagccg 
gtctggtacc 
agcgtttcaa 
aaaaccctgg 
gtaatagcga 
aatggcgctt 
atcttcctga 



ggggtggagt 
gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
cgatgtcgac 
ctttacactt 
acacaggaaa 
cctctgaaag 
tgtgtcagtt 
tgcatctcaa 
gtatgcaaag 
tcccgcccct 
ttatttatgc 
gcttttttgg 
tattttcccc 
cgaaaaatac 
actgatgcct 
ggtgggtgaa 
cgcgctgtat 
cgttacccaa 
agaggcccgc 
tgcctggttt 
ggccgatact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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gtcgtcgtcc 
acctatccca 
tcgctcacat 
gatggcgtta 
gacagtcgtt 
ctcgcggtga 
cggatgagcg 
gafcttccatg 
gttcagatgt 
gaaacgcagg 
ggttatgccg 
gaaatcccga 
gaagcagaag 
ctgaacggca 
ggtcaggtca 
tttaacgccg 
cgctacggcc 
aatcgtctga 
gtgcagcgcg 
cacggcgcta 
gtgcagtatg 
gcgcgcgtgg 
ctttcgctac 
aacagtcttg 
ggcggcttcg 
ccgtggtcgg 
aacggtctgg 
cagcagtttt 
ttccgtcata 
gcaagcggfcg 
gaactaccgc 
aacgcgaccg 
gaaaacctca 
gaaatggatt 
tttctttcac 
ttcacccgtg 
aacgcctggg 
cagtgcacgg 
catcagggga 
atggcgatta 
ctgaactgcc 
gaaaactatc 
gacatgtata 
ttgaattatg 
caacagcaac 
ctgaatatcg 
tcggcggaat 
taataataac 
tatttaaaaa 
atgggagcct 
agtgatacgg 
tttggtctgc 
aaagcaatag 
gtttgtccaa 
agggtgggaa 
gccgccgccg 
acgcgcatgc 
cgccccgtcc 
ttggagactg 
actgactttg 
gatgacaagt 



cctcaaactg 
ttacggtcaa 
ttaatgttga 
actcggcgtt 
tgccgtctga 
tggtgctgcg 
gcattttccg 
ttgccactcg 
gcggcgagtt 
tcgccagcgg 
atcgcgtcac 
atctctatcg 
cctgcgatgt 
agccgttgct 
tggatgagca 
tgcgctgttc 
tgtatgtggt 
ccgatgatcc 
atcgtaatca 
atcacgacgc 
aaggcggcgg 
atgaagacca 
ctggagagac 
gcggtttcgc 
tctgggactg 
cttacggcgg 
tctttgccga 
tccagttccg 
gcgataacga 
aagtgcctct 
agccggagag 
catggtcaga 
gtgtgacgct 
tttgcatcga 
agatgtggat 
caccgctgga 
tcgaacgctg 
cagatacact 
aaaccttatt 
ccgttgatgt 
agctggcgca 
ccgaccgcct 
ccccgtacgt 
gcccacacca 
tgatggaaac 
acggtttcca 
tccagctgag 
cgggcaggcc 
acacaaactt 
acttcccgtt 
gtattatttt 
tttctgacaa 
catcacaaat 
actcatcaat 
agaatatata 
ccatgagcac 
ccccatgggc 
tgcccgcaaa 
cagcctccgc 
ctttcctgag 
tgacggctct 



gcagatgcac 
tccgccgttt 
tgaaagctgg 
tcatctgtgg 
atttgacctg 
ttggagtgac 
tgacgtctcg 
ctttaatgat 
gcgtgactac 
caccgcgcct 
actacgtctg 
tgcggtggtt 
cggtttccgc 
gattcgaggc 
gacgatggtg 
gcattatccg 
ggatgaagcc 
gcgctggcta 
cccgagtgtg 
gctgtatcgc 
agccgacacc 
gcccttcccg 
gcgcccgctg 
taaatactgg 
ggtggatcag 
tgattttggc 
ccgcacgccg 
tttatccggg 
gctcctgcac 
ggatgtcgct 
cgccgggcaa 
agccgggcac 
ccccgccgcg 
gctgggtaat 
tggcgataaa 
taacgacatt 
gaaggcggcg 
tgctgatgcg 
tatcagccgg 
tgaagtggcg 
ggtagcagag 
tactgccgcc 
cttcccgagc 
gtggcgcggc 
cagccatcgc 
tatggggatt 
cgccggtcgc 
atgtctgccc 
ttggatgttc 
tttcccgatt 
tgccgctatt 
actcggaact 
ttcacaaata 
gtatcttatc 
aggtgggggt 
caactcgttt 
cggggtgcgt 
ctctactacc 
cgccgcttca 
cccgcttgca 
tttggcacaa 



ggttacgatg 
gttcccacgg 
ctacaggaag 
tgcaacgggc 
agcgcatttt 
ggcagttatc 
ttgctgcata 
gatttcagcc 
ctacgggtaa 
ttcggcggtg 
aacgtcgaaa 
gaactgcaca 
gaggtgcgga 
gttaaccgtc 
caggatatcc 
aaccatccgc 
aatattgaaa 
ccggcgatga 
atcatctggt 
tggatcaaat 
acggccaccg 
gctgtgccga 
atcctttgcg 
caggcgtttc 
tcgctgatta 
gatacgccga 
catccagcgc 
caaaccatcg 
tggatggtgg 
ccacaaggta 
ctctggctca 
atcagcgcct 
tcccacgcca 
aagcgttggc 
aaacaactgc 
ggcgtaagtg 
ggccattacc 
gtgctgatta 
aaaacctacc 
agcgatacac 
cgggtaaact 
tgttttgacc 
gaaaacggtc 
gacttccagt 
catctgctgc 
ggtggcgacg 
taccattacc 
gtatttcgcg 
ggtttattct 
tggctacatg 
tctctgttct 
tgtttattgc 
aagcattttt 
atgtctggat 
ctfcatgtagt 
gatggaagca 
cagaatgtga 
ttgacctacg 
gccgctgcag 
agcagtgcag 
ttggattctt 



cgcccatcta 
agaatccgac 
gccagacgcg 
gctgggtcgg 
tacgcgccgg 
tggaagatca 
aaccgactac 
gcgctgtact 
cagtttcttt 
aaattatcga 
acccgaaacfc 
ccgccgacgg 
ttgaaaatgg 
acgagcatca 
tgctgatgaa 
tgtggtacac 
cccacggcat 
gcgaacgcgt 
cgctggggaa 
ctgtcgatcc 
atattatttg 
aatggtccat 
aatacgccca 
gtcagtatcc 
aatatgatga 
acgatcgcca 
tgacggaagc 
aagtgaccag 
cgctggatgg 
aacagttgat 
cagtacgcgt 
ggcagcagtg 
tcccgcatct 
aatttaaccg 
tgacgccgct 
aagcgacccg 
aggccgaagc 
cgaccgctca 
ggattgatgg 
cgcatccggc 
ggctcggatt 
gctgggatct 
tgcgctgcgg 
tcaacatcag 
acgcggaaga 
actcctggag 
agttggtctg 
taaggaaatc 
ttttctttta 
acatcaacca 
cgctattatt 
agcttataat 
ttcactgcat 
ccagatctgg 
tttgtatctg 
ttgtgagctc 
tgggctccag 
agaccgtgtc 
ccaccgcccg 
cttcccgttc 
tgacccggga 



caccaacgta 
gggttgttac 
aattattttt 
ttacggccag 
agaaaaccgc 
ggatatgtgg 
acaaatcagc 
ggaggctgaa 
atggcagggt 
tgagcgtggt 
gtggagcgcc 
cacgctgatt 
tctgctgctg 
tcctctgcat 
gcagaacaac 
gctgtgcgac 
ggtgccaatg 
aacgcgaatg 
tgaatcaggc 
ttcccgcccg 
cccgatgtac 
caaaaaatgg 
cgcgatgggt 
ccgtttacag 
aaacggcaac 
gttctgtatg 
aaaacaccag 
cgaatacctg 
taagccgctg 
tgaactgcct 
agtgcaaccg 
gcgtctggcg 
gaccaccagc 
ccagtcaggc 
gcgcgatcag 
cattgaccct 
agcgttgttg 
cgcgtggcag 
tagtggtcaa 
gcggattggc 
agggccgcaa 
gccattgtca 
gacgcgcgaa 
ccgctacagt 
aggcacatgg 
cccgtcagta 
gtgtcaaaaa 
cattatgtac 
cttttttatc 
tatcagcaaa 
ccaaccgctg 
ggttacaaat 
tctagttgtg 
gcgtggctta 
ttttgcagca 
atatttgaca 
cattgatggt 
tggaacgccg 
cgggattgtg 
atccgcccgc 
acttaatgtc 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
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gtttctcagc agctgttgga tctgcgccag 
cccaatgcgg tttaaaacat aaataaaaaa 
gtgtcttgct gtctttattt aggggttttg 
cggtcgttga gggtcctgtg tattttttcc 
agatacatgg gcataagccc gtctctgggg 
tgcggggtgg tgttgtagat gatccagtcg 
atgtctttca gtagcaagct gattgccagg 
cggttaagct gggatgggtg" catacgtggg 
aggttggcta tgttcccagc catatccctc 
acagtgtatc cggtgcactt gggaaatttg 
aacttggaga cgcccttgtg acctccaaga 
atgggcccac gggcggcggc ctgggcgaag 
tgttccagga tgagatcgtc ataggccatt 
tgcggtataa tggttccatc cggcccaggg 
cacgctttga gttcagatgg ggggatcatg 
tccggggtag gggagatcag ctgggaagaa 
cagccggtgg gcccgtaaat cacacctatt 
cagctgccgt catccctgag caggggggcc 
ttttccctga ccaaatccgc cagaaggcgc 
gaagcaaagt ttttcaacgg tttgagaccg 
ccaagcagtt ccaggcggtc ccacagctcg 
atatctcctc gtttcgcggg ttggggcggc 
ccagacgggc cagggtcatg tctttccacg 
tcacggtgaa ggggtgcgct ccgggctgcg 
tgctggtgct gaagcgctgc cggtcttcgc 
tggtgtcata gtccagcccc tccgcggcgt 
aggcgccgca cgaggggcag tgcagacttt 
ccgattccgg ggagtaggca tccgcgccgc 
gccaggtgag ctctggccgt tcggggtcaa 
gtttcttacc tctggtttcc atgagccggt 
tgtccccgta tacagacttg agaggcctgt 
atagaaactc ggaccactct gagacaaagg 
agtgggaggg gtagcggtcg ttgtccacta 
acatgtcgcc ctcttcggca tcaaggaagg 
cgggtgttcc tgaagggggg ctataaaagg 
ccgcatcgct gtctgcgagg gccagctgtt 
tgacttctgc gctaagattg tcagtttcca 
ccgcggtgat gcctttgagg gtggccgcat 
tgtcaagctt ggtggcaaac gacccgtaga 
gcagggtttg gtttttgtcg cgatcggcgc 
attcgcgcgc aacgcaccgc cattcgggaa 
gcacgcgcca accgcggttg tgcagggtga 
gtaggcgctc gttggtccag cagaggcggc 
ggtctagctg cgtctcgtcc ggggggtctg 
gcgcgtcgaa gtagtctatc ttgcatcctt 
cggcaagcgc gcgctcgtat gggttgagtg 
cggaggcgta catgccgcaa atgtcgtaaa 
atgtagggta gcatcttcca ccgcggatgc 
agggagcgag gaggtcggga ccgaggttgc 
tctgcctgaa gatggcatgt gagttggatg 
tggcgtctgt gagacctacc gcgtcacgca 
tgaccagctc ggcggtgacc tgcacgtcta 
tgtcatactt atcctgtccc ttttttttcc 
ggtctttcca gtactcttgg atcggaaacc 
tgtagaactg gttgacggcc tggtaggcgc 
cctgcgcggc cttccggagc gaggtgtggg 
tgaggtactg gtatttgaag tcagtgtcgt 
ccgtgcgctt tttggaacgc ggatttggca 
ttcccgcgcg aggcataaag ttgcgtgtga 
tgttaattac ctgggcggcg agcacgatct 
tgtaaagttc caagaagcgc gggatgccct 



caggtttctg ccctgaaggc ttcctcccct 5160 
ccagactctg tttggatttg gatcaagcaa 522 0 
cgcgcgcggt aggcccggga ccagcggtct 52 8 0 
aggacgtggt aaaggtgact ctggatgttc 5340 
tggaggtagc accactgcag agcttcatgc 54 0 0 
tagcaggagc gctgggcgtg gtgcctaaaa 54 6 0 
ggcaggccct tggtgtaagt gtttacaaag 552 0 
gatatgagat gcatcttgga ctgtattttt 5580 
cggggattca tgttgtgcag aaccaccagc 5640 
tcatgtagct tagaaggaaa tgcgtggaag 570 0 
ttttccatgc attcgtccat aatgatggca 5760 
atatttctgg gatcactaac gtcatagttg 582 0 
tttacaaagc gcgggcggag ggtgccagac 5 88 0 
gcgtagttac cctcacagat ttgcatttcc 5 94 0 
tctacctgcg gggcgatgaa gaaaacggtt 600 0 
agcaggttcc tgagcagctg cgacttaccg 6 06 0 
accgggtgca actggtagtt aagagagctg 612 0 
acttcgttaa gcatgtccct gactcgcatg 6180 
tcgccgccca gcgatagcag ttcttgcaag 624 0 
tccgccgtag gcatgctttt gagcgtttga 63 0 0 
gtcacctgct ctacggcatc tcgatccagc 6360 
tttcgctgta cggcagtagt cggtgctcgt 642 0 
ggcgcagggt cctcgtcagc gtagtctggg 64 80 
cgctggccag ggtgcgcttg aggctggtcc 654 0 
cctgcgcgtc ggccaggtag catttgacca 660 0 
ggcccttggc gcgcagcttg cccttggagg 6 66 0 
tgagggcgta gagcttgggc gcgagaaata 672 0 
aggccccgca gacggtctcg cattccacga 6780 
aaaccaggtt tcccccatgc tttttgatgc 6840 
gtccacgctc ggtgacgaaa aggctgtccg 690 0 
cctcgagcgg tgttccgcgg tcctcctcgt 6960 
ctcgcgtcca ggccagcacg aaggaggcta 7 02 0 
gggggtccac tcgctccagg gtgtgaagac 708 0 
tgattggttt gtaggtgtag gccacgtgac 714 0 
gggtgggggc gcgttcgtcc tcactctctt 72 0 0 
ggggtgagta ctccctctga aaagcgggca 7260 
aaaacgagga ggatttgata ttcacctggc 732 0 
ccatctggtc agaaaagaca atctttttgt 7380 
gggcgttgga cagcaacttg gcgatggagc 744 0 
gctccttggc cgcgatgttt agctgcacgt 750 0 
agacggtggt gcgctcgtcg ggcaccaggt 7560 
caaggtcaac gctggtggct acctctccgc 7 62 0 
cgcccttgcg cgagcagaat ggcggtaggg 768 0 
cgtccacggt aaagaccccg ggcagcaggc 774 0 
gcaagtctag cgcctgctgc catgcgcggg 7 80 0 
ggggacccca tggcatgggg tgggtgagcg 7 86 0 
cgtagagggg ctctctgagt attccaagat 792 0 
tggcgcgcac gtaatcgtat agttcgtgcg 798 0 
tacgggcggg ctgctctgct cggaagacta 8 04 0 
atatggttgg acgctggaag acgttgaagc 810 0 
cgaaggaggc gtaggagtcg cgcagcttgt 816 0 
gggcgcagta gtccagggtt tccttgatga 822 0 
acagctcgcg gttgaggaca aactcttcgc 82 80 
cgtcggcctc cgaacggtaa gagcctagca 834 0 
agcatccctt ttctacgggt agcgcgtatg 840 0 
tgagcgcaaa ggtgtccctg accatgactt 8460 
cgcatccgcc ctgctcccag agcaaaaagt 852 0 
gggcgaaggt gacatcgttg aagagtatct 858 0 
tgcggaaggg tcccggcacc tcggaacggt 864 0 
cgtcaaagcc gttgatgttg tggcccacaa 8 70 0 
tgatggaagg caatttttta agttcctcgt 8760 
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aggtgagctc ttcaggggag ctgagcccgt gctctgaaag ggcccagtct gcaagatgag 8820 
ggttggaagc gacgaatgag ctccacaggt cacgggccat tagcatttgc aggtggtcgc 8 880 
gaaaggtcct aaactggcga cctatggcca ttttttctgg ggtgatgcag tagaaggtaa 8 94 0 
gcgggtcttg ttcccagcgg tcccatccaa ggttcgcggc taggtctcgc gcggcagtca 90 00 
ctagaggctc atctccgccg aacttcatga ccagcatgaa gggcacgagc tgcttcccaa 9060 
aggcccccat ccaagtatag gtctctacat cgtaggtgac aaagagacgc tcggtgcgag 912 0 
gatgcgagcc gatcgggaag aactggatct cccgccacca attggaggag tggctattga 9180 
tgtggtgaaa gtagaagtcc ctgcgacggg ccgaacacfcc gtgctggctt ttgtaaaaac 9240 
gtgcgcagta ctggcagcgg tgcacgggct gtacatcctg cacgaggttg acctgacgac 93 00 
cgcgcacaag gaagcagagt gggaatttga gcccctcgcc tggcgggttt ggctggtggt 93 60 
cttctacttc ggctgcttgt ccttgaccgt ctggctgctc gaggggagtt acggtggatc 9420 
ggaccaccac gccgcgcgag cccaaagtcc agatgtccgc gcgcggcggt cggagcttga 94 80 
tgacaacatc gcgcagatgg gagctgtcca tggtctggag ctcccgcggc gtcaggtcag 9540 
gcgggagctc ctgcaggttt acctcgcata gacgggtcag ggcgcgggct agatccaggt 960 0 
gatacctaat ttccaggggc tggttggtgg cggcgtcgat ggcttgcaag aggccgcatc 9660 
cccgcggcgc gactacggta ccgcgcggcg ggcggtgggc cgcgggggtg tccttggatg 972 0 
atgcatctaa aagcggtgac gcgggcgagc ccccggaggt agggggggct ccggacccgc 97 80 
cgggagaggg ggcaggggca cgtcggcgcc gcgcgcgggc aggagctggt gctgcgcgcg 9840 
taggttgctg gcgaacgcga cgacgcggcg gttgatctcc tgaatctggc gcctctgcgt 99 0 0 
gaagacgacg ggcccggtga gcttgagcct gaaagagagt tcgacagaat caatttcggt 9960 
gtcgttgacg gcggcctggc gcaaaatctc ctgcacgtct cctgagttgt cttgataggc 10 02 0 
gatctcggcc atgaactgct cgatctcttc ctcctggaga tctccgcgtc cggctcgctc 10 08 0 
cacggtggcg gcgaggtcgt tggaaatgcg ggccatgagc tgcgagaagg cgttgaggcc 10140 
tccctcgttc cagacgcggc tgtagaccac gcccccttcg gcatcgcggg cgcgcatgac 102 0 0 
cacctgcgcg agattgagct ccacgtgccg ggcgaagacg gcgtagtttc gcaggcgctg 10260 
aaagaggtag ttgagggtgg tggcggtgtg ttctgccacg aagaagtaca taacccagcg 10320 
tcgcaacgtg gattcgttga tatcccccaa ggcctcaagg cgctccatgg cctcgtagaa 10380 
gtccacggcg aagttgaaaa actgggagtt gcgcgccgac acggttaact cctcctccag 10440 
aagacggatg agctcggcga cagtgtcgcg cacctcgcgc tcaaaggcta caggggcctc 10500 
ttcttcttct tcaatctcct cttccataag ggcctcccct tcttcttctt ctggcggcgg 10560 
tgggggaggg gggacacggc ggcgacgacg gcgcaccggg aggcggtcga caaagcgctc 10 62 0 
gatcatctcc ccgcggcgac ggcgcatggt ctcggtgacg gcgcggccgt tctcgcgggg 10 68 0 
gcgcagttgg aagacgccgc ccgtcatgtc ccggttatgg gttggcgggg ggctgccatg 10740 
cggcagggat acggcgctaa cgatgcatct caacaattgt tgtgtaggta ctccgccgcc 10 800 
gagggacctg agcgagtccg catcgaccgg atcggaaaac ctctcgagaa aggcgtctaa 10 86 0 
ccagtcacag tcgcaaggta ggctgagcac cgtggcgggc ggcagcgggc ggcggtcggg 10920 
gttgtttctg gcggaggtgc tgctgatgat gtaattaaag taggcggtct tgagacggcg 10980 
gatggtcgac agaagcacca tgtccttggg tccggcctgc tgaatgcgca ggcggtcggc 11040 
catgccccag gcttcgtttt gacatcggcg caggtctttg tagtagtctt gcatgagcct 1110 0 
ttctaccggc acttcttctt ctccttcctc ttgtcctgca tctcttgcat ctatcgctgc 11160 
ggcggcggcg gagtttggcc gtaggtggcg ccctcttcct cccatgcgtg tgaccccgaa 11220 
gcccctcatc ggctgaagca gggctaggtc ggcgacaacg cgctcggcta atatggcctg 112 80 
ctgcacctgc gtgagggtag actggaagtc atccatgtcc acaaagcggt ggtatgcgcc 113 40 
cgtgttgatg gtgtaagtgc agttggccat aacggaccag ttaacggtct ggtgacccgg 11400 
ctgcgagagc tcggtgtacc tgagacgcga gtaagccctc gagtcaaata cgtagtcgtt 11460 
gcaagtccgc accaggtact ggtatcccac caaaaagtgc ggcggcggct ggcggtagag 11520 
gggccagcgt agggtggccg gggctccggg ggcgagatct tccaacataa ggcgatgata 115 8 0 
tccgtagatg tacctggaca tccaggtgat gccggcggcg gtggtggagg cgcgcggaaa 11640 
gtcgcggacg cggttccaga tgttgcgcag cggcaaaaag tgctccatgg tcgggacgct 117 0 0 
ctggccggtc aggcgcgcgc aatcgttgac gctctagacc gtgcaaaagg agagcctgta 11760 
agcgggcact cttccgtggt ctggtggata aattcgcaag ggtatcatgg cggacgaccg 1182 0 
gggttcgagc cccgtatccg gccgtccgcc gtgatccatg cggttaccgc ccgcgtgtcg 118 80 
aacccaggtg tgcgacgtca gacaacgggg gagtgctcct tttggcttcc ttccaggcgc 11940 
ggcggctgct gcgctagctt ttttggccac tggccgcgcg cagcgtaagc ggttaggctg 12000 
gaaagcgaaa gcattaagtg gctcgctccc tgtagccgga gggttatttt ccaagggttg 12 060 
agtcgcggga cccccggttc gagtctcgga ccggccggac tgcggcgaac gggggtttgc 12120 
ctccccgtca tgcaagaccc cgcttgcaaa ttcctccgga aacagggacg agcccctttt 12180 
ttgcttttcc cagatgcatc cggtgctgcg gcagatgcgc ccccctcctc agcagcggca 12240 
agagcaagag cagcggcaga catgcagggc accctcccct cctcctaccg cgtcaggagg 123 00 
ggcgacatcc gcggttgacg cggcagcaga tggtgattac gaacccccgc ggcgccgggc 123 60 
ccggcactac ctggacttgg aggagggcga gggcctggcg cggctaggag cgccctctcc 1242 0 
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tgagcggtac ccaagggtgc agctgaagcg tgatacgcgt gaggcgtacg tgccgcggca 12480 
gaacctgttt cgcgaccgcg agggagagga gcccgaggag atgcgggatc gaaagttcca 1254 0 
cgcagggcgc gagctgcggc atggcctgaa tcgcgagcgg ttgctgcgcg aggaggactt 12 60 0 
tgagcccgac gcgcgaaccg ggattagtcc cgcgcgcgca cacgtggcgg ccgccgacct 12660 
ggtaaccgca tacgagcaga cggtgaacca ggagattaac tttcaaaaaa gctttaacaa 12 72 0 
ccacgtgcgt acgcttgtgg cgcgcgagga ggtggctata ggactgatgc atctgtggga 12 78 0 
ctttgtaagc gcgctggagc aaaacccaaa tagcaagccg ctcatggcgc agctgttcct 12 84 0 
tatagtgcag cacagcaggg acaacgaggc attcagggat gcgctgctaa acatagtaga 12 90 0 
gcccgagggc cgctggctgc tcgatttgat aaacatcctg cagagcatag tggtgcagga 12 96 0 
gcgcagcttg agcctggctg acaaggtggc cgccatcaac tattccatgc ttagcctggg 13 02 0 
caagttttac gcccgcaaga tataccatac cccttacgtt cccatagaca aggaggtaaa 13080 
gatcgagggg ttctacatgc gcatggcgct gaaggtgcfct accttgagcg acgacctggg 1314 0 
cgtttatcgc aacgagcgca tccacaaggc cgtgagcgtg agccggcggc gcgagctcag 132 0 0 
cgaccgcgag ctgatgcaca gcctgcaaag ggccctggct ggcacgggca gcggcgatag 13 2 60 
agaggccgag tcctactttg acgcgggcgc tgacctgcgc tgggccccaa gccgacgcgc 13 32 0 
cctggaggca gctggggccg gacctgggct ggcggtggca cccgcgcgcg ctggcaacgt 133 80 
cggcggcgtg gaggaatatg acgaggacga tgagtacgag ccagaggacg gcgagtacta 1344 0 
agcggtgatg tttctgatca gatgatgcaa gacgcaacgg acccggcggt gcgggcggcg 1350 0 
ctgcagagcc agccgtccgg ccttaactcc acggacgact ggcgccaggt catggaccgc 13560 
atcatgtcgc tgactgcgcg caatcctgac gcgttccggc agcagccgca ggccaaccgg 13 62 0 
ctctccgcaa ttctggaagc ggtggtcccg gcgcgcgcaa accccacgca cgagaaggtg 13 68 0 
ctggcgatcg taaacgcgct ggccgaaaac agggccatcc ggcccgacga ggccggcctg 13 74 0 
gtctacgacg cgctgcttca gcgcgtggct cgttacaaca gcggcaacgt gcagaccaac 13 80 0 
ctggaccggc tggtggggga tgtgcgcgag gccgtggcgc agcgtgagcg cgcgcagcag 13 86 0 
cagggcaacc tgggctccat ggttgcacta aacgccttcc tgagtacaca gcccgccaac 13 92 0 
gtgccgcggg gacaggagga ctacaccaac tttgtgagcg cactgcggct aatggtgact 13 98 0 
gagacaccgc aaagtgaggt gtaccagtct gggccagact attttttcca gaccagtaga 14040 
caaggcctgc agaccgtaaa cctgagccag gctttcaaaa acttgcaggg gctgtggggg 14100 
gtgcgggctc ccacaggcga ccgcgcgacc gtgtctagct tgctgacgcc caactcgcgc 14160 
ctgttgctgc tgctaatagc gcccttcacg gacagtggca gcgtgtcccg ggacacatac 1422 0 
ctaggtcact tgctgacact gtaccgcgag gccataggtc aggcgcatgt ggacgagcat 14280 
actttccagg agattacaag tgtcagccgc gcgctggggc aggaggacac gggcagcctg 14340 
gaggcaaccc taaactacct gctgaccaac cggcggcaga agatcccctc gttgcacagt 144 0 0 
ttaaacagcg aggaggagcg cattttgcgc tacgtgcagc agagcgtgag ccttaacctg 14460 
atgcgcgacg gggtaacgcc cagcgtggcg ctggacatga ccgcgcgcaa catggaaccg 14 52 0 
ggcatgtatg cctcaaaccg gccgtttatc aaccgcctaa tggactactt gcatcgcgcg 14580 
gccgccgtga accccgagta tttcaccaat gccatcttga acccgcactg gctaccgccc 14640 
cctggtttct acaccggggg attcgaggtg cccgagggta acgatggatt cctctgggac 14700 
gacatagacg acagcgtgtt ttccccgcaa ccgcagaccc tgctagagtt gcaacagcgc 14760 
gagcaggcag aggcggcgct gcgaaaggaa agcttccgca ggccaagcag cttgtccgat 1482 0 
ctaggcgctg cggccccgcg gtcagatgct agtagcccat ttccaagctt gatagggtct 14 880 
cttaccagca ctcgcaccac ccgcccgcgc ctgctgggcg aggaggagta cctaaacaac 14940 
tcgctgctgc agccgcagcg cgaaaaaaac ctgcctccgg catttcccaa caacgggata 15000 
gagagcctag tggacaagat gagtagatgg aagacgtacg cgcaggagca cagggacgtg 15060 
ccaggcccgc gcccgcccac ccgtcgtcaa aggcacgacc gtcagcgggg tctggtgtgg 1512 0 
gaggacgatg actcggcaga cgacagcagc gtcctggatt tgggagggag tggcaacccg 15180 
tttgcgcacc ttcgccccag gctggggaga atgttttaaa aaaaaaaaag catgatgcaa 15240 
aataaaaaac tcaccaaggc catggcaccg agcgttggtt ttcttgtatt ccccttagta 153 0 0 
tgcggcgcgc ggcgatgtat gaggaaggtc ctcctccctc ctacgagagt gtggtgagcg 153 60 
cggcgccagt ggcggcggcg ctgggttctc ccttcgatgc tcccctggac ccgccgtttg 1542 0 
tgcctccgcg gtacctgcgg cctaccgggg ggagaaacag catccgttac tctgagttgg 154 80 
cacccctatt cgacaccacc cgtgtgtacc tggtggacaa caagtcaacg gatgtggcat 15540 
ccctgaacta ccagaacgac cacagcaact ttctgaccac ggtcattcaa aacaatgact 15600 
acagcccggg ggaggcaagc acacagacca tcaatcttga cgaccggtcg cactggggcg 15660 
gcgacctgaa aaccatcctg cataccaaca tgccaaatgt gaacgagttc atgtttacca 15720 
ataagtttaa ggcgcgggtg atggtgtcgc gcttgcctac taaggacaat caggtggagc 15780 
tgaaatacga gtgggtggag ttcacgctgc ccgagggcaa ctactccgag accatgacca 15 840 
tagaccttat gaacaacgcg atcgtggagc actacttgaa agtgggcaga cagaacgggg 15900 
ttctggaaag cgacatcggg gtaaagtttg acacccgcaa cttcagactg gggtttgacc 15960 
ccgtcactgg tcttgtcatg cctggggtat atacaaacga agccttccat ccagacatca 16020 
ttttgctgcc aggatgcggg gtggacttca cccacagccg cctgagcaac ttgttgggca 16080 
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tccgcaagcg gcaacccttc caggagggct ttaggatcac ctacgatgat ctggagggtg 16140 
gtaacattcc cgcactgttg gatgtggacg cctaccaggc gagcttgaaa gatgacaccg 16200 
aacagggcgg gggtggcgca ggcggcagca acagcagtgg cagcggcgcg gaagagaact 16260 
ccaacgcggc agccgcggca atgcagccgg tggaggacat gaacgatcat gccattcgcg 1632 0 
gcgacacctt tgccacacgg gctgaggaga agcgcgctga ggccgaagca gcggccgaag 163 8 0 
ctgccgcccc cgctgcgcaa cccgaggtcg agaagcctca gaagaaaccg gtgatcaaac 1644 0 
ccctgacaga ggacagcaag aaacgcagtt acaacctaat aagcaatgac agcaccttca 16500 
cccagtaccg cagctggtac cttgcataca actacggcga ccctcagacc ggaatccgct 1656 0 
catggaccct gctttgcact cctgacgtaa cctgcggctc ggagcaggtc tactggtcgt 16620 
tgccagacat gatgcaagac cccgtgacct tccgctccac gcgccagatc agcaactttc 16680 
cggtggtggg cgccgagctg ttgcccgtgc actccaagag cttctacaac gaccaggccg 1674 0 
tctactccca actcatccgc cagtttacct ctctgaccca cgtgttcaat cgctttcccg 16800 
agaaccagat tttggcgcgc ccgccagccc ccaccatcac caccgtcagt gaaaacgttc 16860 
ctgctctcac agatcacggg acgctaccgc tgcgcaacag catcggagga gtccagcgag 1692 0 
tgaccattac tgacgccaga cgccgcacct gcccctacgt ttacaaggcc ctgggcatag 16980 
tctcgccgcg cgtcctatcg agccgcactt tttgagcaag catgtccatc cttatatcgc 17040 
ccagcaataa cacaggctgg ggcctgcgct tcccaagcaa gatgtttggc ggggccaaga 17100 
agcgctccga ccaacaccca gtgcgcgtgc gcgggcacta ccgcgcgccc tggggcgcgc 17160 
acaaacgcgg ccgcactggg cgcaccaccg tcgatgacgc catcgacgcg gtggtggagg 1722 0 
aggcgcgcaa ctacacgccc acgccgccac cagtgtccac agtggacgcg gccattcaga 17280 
ccgtggtgcg cggagcccgg cgctatgcta aaatgaagag acggcggagg cgcgtagcac 1734 0 
gtcgccaccg ccgccgaccc ggcactgccg cccaacgcgc ggcggcggcc ctgcttaacc 17400 
gcgcacgtcg caccggccga cgggcggcca tgcgggccgc tcgaaggctg gccgcgggta 17460 
ttgtcactgt gccccccagg tccaggcgac gagcggccgc cgcagcagcc gcggccatta 1752 0 
gtgctatgac tcagggtcgc aggggcaacg tgtattgggt gcgcgactcg gttagcggcc 17580 
tgcgcgtgcc cgtgcgcacc cgccccccgc gcaactagat tgcaagaaaa aactacttag 17640 
actcgtactg ttgtatgtat ccagcggcgg cggcgcgcaa cgaagctatg tccaagcgca 17700 
aaatcaaaga agagatgctc caggtcatcg cgccggagat ctatggcccc ccgaagaagg 17760 
aagagcagga ttacaagccc cgaaagctaa agcgggtcaa aaagaaaaag aaagatgatg 17820 
atgatgaact tgacgacgag gtggaactgc tgcacgctac cgcgcccagg cgacgggtac 17 8 80 
agtggaaagg tcgacgcgta aaacgtgttt tgcgacccgg caccaccgta gtctttacgc 17 940 
ccggtgagcg ctccacccgc acctacaagc gcgtgtatga tgaggtgtac ggcgacgagg 18000 
acctgcttga gcaggccaac gagcgcctcg gggagtttgc ctacggaaag cggcataagg 18060 
acatgctggc gttgccgctg gacgagggca acccaacacc tagcctaaag cccgtaacac 1812 0 
tgcagcaggt gctgcccgcg cttgcaccgt ccgaagaaaa gcgcggccta aagcgcgagt 18180 
ctggtgactt ggcacccacc gtgcagctga tggtacccaa gcgccagcga ctggaagatg 18240 
tcttggaaaa aatgaccgtg gaacctgggc tggagcccga ggtccgcgtg cggccaatca 183 00 
agcaggtggc gccgggactg ggcgtgcaga ccgtggacgt tcagataccc actaccagta 18360 
gcaccagtat tgccaccgcc acagagggca tggagacaca aacgtccccg gttgcctcag 18420 
cggtggcgga tgccgcggtg caggcggtcg ctgcggccgc gtccaagacc tctacggagg 18480 
tgcaaacgga cccgtggatg tttcgcgttt cagccccccg gcgcccgcgc ggttcgagga 18540 
agtacggcgc cgccagcgcg ctactgcccg aatatgccct acatccttcc attgcgccta 18600 
cccccggcta tcgtggctac acctaccgcc ccagaagacg agcaactacc cgacgccgaa 18660 
ccaccactgg aacccgccgc cgccgtcgcc gtcgccagcc cgtgctggcc ccgatttccg 1872 0 
tgcgcagggt ggctcgcgaa ggaggcagga ccctggtgct gccaacagcg cgctaccacc 18780 
ccagcatcgt ttaaaagccg gtctttgtgg ttcttgcaga tatggccctc acctgccgcc 18840 
tccgtttccc ggtgccggga ttccgaggaa gaatgcaccg taggaggggc atggccggcc 18900 
acggcctgac gggcggcatg cgtcgtgcgc accaccggcg gcggcgcgcg tcgcaccgtc 18960 
gcatgcgcgg cggtatcctg cccctcctta ttccactgat cgccgcggcg attggcgccg 19020 
tgcccggaat tgcatccgtg gccttgcagg cgcagagaca ctgattaaaa acaagttgca 190 80 
tgtggaaaaa tcaaaataaa aagtctggac tctcacgctc gcttggtcct gtaactattt 19140 
tgtagaatgg aagacatcaa ctttgcgtct ctggccccgc gacacggctc gcgcccgttc 192 00 
atgggaaact ggcaagatat cggcaccagc aatatgagcg gtggcgcctt cagctggggc 192 60 
tcgctgtgga gcggcattaa aaatttcggt tccaccgtta agaactatgg cagcaaggcc 19320 
tggaacagca gcacaggcca gatgctgagg gataagttga aagagcaaaa tttccaacaa 193 80 
aaggtggtag atggcctggc ctctggcatt agcggggtgg tggacctggc caaccaggca 1944 0 
gtgcaaaata agattaacag taagcttgat ccccgccctc ccgtagagga gcctccaccg 19500 
gccgtggaga cagtgtctcc agaggggcgt ggcgaaaagc gtccgcgccc cgacagggaa 19560 
gaaactctgg tgacgcaaat agacgagcct ccctcgtacg aggaggcact aaagcaaggc 19620 
ctgcccacca cccgtcccat cgcgcccatg gctaccggag tgctgggcca gcacacaccc 19680 
gtaacgctgg acctgcctcc ccccgccgac acccagcaga aacctgtgct gccaggcccg 19740 
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accgccgttg ttgtaacccg tcctagccgc gcgtccctgc gccgcgccgc cagcggtccg 19800 
cgatcgttgc ggcccgtagc cagtggcaac tggcaaagca cactgaacag catcgtgggt 19860 
ctgggggtgc aatccctgaa gcgccgacga tgcttctgaa tagctaacgt gtcgtatgtg 19920 
tgtcatgtat gcgtccatgt cgccgccaga ggagctgctg agccgccgcg cgcccgcttt 19980 
ccaagatggc taccccttcg atgatgccgc agtggtctta catgcacatc tcgggccagg 2 0 040 
acgcctcgga gtacctgagc cccgggctgg tgcagtttgc ccgcgccacc gagacgtact 20100 
tcagcctgaa taacaagttt agaaacccca cggtggcgcc tacgcacgac gtgaccacag 20160 
accggtccca gcgtttgacg ctgcggttca tccctgtgga ccgtgaggat actgcgtact 2022 0 
cgtacaaggc gcggttcacc ctagctgtgg gtgataaccg tgtgctggac atggcttcca 20280 
cgtactttga catccgcggc gtgctggaca ggggccctac ttttaagccc tactctggca 2 0 340 
ctgcctacaa cgccctggct cccaagggtg ccccaaatcc ttgcgaatgg gatgaagctg 20400 
ctactgctct tgaaataaac ctagaagaag aggacgatga caacgaagac gaagtagacg 2 0460 
agcaagctga gcagcaaaaa actcacgtat ttgggcaggc gccttattct ggtataaata 2 0 520 
ttacaaagga gggtattcaa ataggtgtcg aaggtcaaac acctaaatat gccgataaaa 20580 
catttcaacc tgaacctcaa ataggagaat ctcagtggta cgaaactgaa attaatcatg 2 0 640 
cagctgggag agtccttaaa aagactaccc caatgaaacc atgttacggt tcatatgcaa 2 0 700 
aacccacaaa tgaaaatgga gggcaaggca ttcttgtaaa gcaacaaaat ggaaagctag 2 0 760 
aaagtcaagt ggaaatgcaa tttttctcaa ctactgaggc gaccgcaggc aatggtgata 20820 
acttgactcc taaagtggta ttgtacagtg aagatgtaga tatagaaacc ccagacactc 20880 
atatttctta catgcccact attaaggaag gtaactcacg agaactaatg ggccaacaat 20940 
ctatgcccaa caggcctaat tacattgctt ttagggacaa ttttattggt ctaatgtatt 21000 
acaacagcac gggtaatatg ggtgttctgg cgggccaagc atcgcagttg aatgctgttg 21060 
tagatttgca agacagaaac acagagcttt cataccagct tttgcttgat tccattggtg 2112 0 
atagaaccag gtacttttct atgtggaatc aggctgttga cagctatgat ccagatgtta 21180 
gaattattga aaatcatgga actgaagatg aacttccaaa ttactgcttt ccactgggag 21240 
gtgtgattaa tacagagact cttaccaagg taaaacctaa aacaggtcag gaaaatggat 213 0 0 
gggaaaaaga tgctacagaa ttttcagata aaaatgaaat aagagttgga aataattttg 213 6 0 
ccatggaaat caatctaaat gccaacctgt ggagaaattt cctgtactcc aacatagcgc 2142 0 
tgtatttgcc cgacaagcta aagtacagtc cttccaacgt aaaaatttct gataacccaa 2148 0 
acacctacga ctacatgaac aagcgagtgg tggctcccgg gttagtggac tgctacatta 2154 0 
accttggagc acgctggtcc cttgactata tggacaacgt caacccattt aaccaccacc 21600 
gcaatgctgg cctgcgctac cgctcaatgt tgctgggcaa tggtcgctat gtgcccttcc 21660 
acatccaggt gcctcagaag ttctttgcca ttaaaaacct ccttctcctg ccgggctcat 2172 0 
acacctacga gtggaacttc aggaaggatg ttaacatggt tctgcagagc tccctaggaa 2178 0 
atgacctaag ggttgacgga gccagcatta agtttgatag catttgcctt tacgccacct 2184 0 
tcttccccat ggcccacaac accgcctcca cgcttgaggc catgcttaga aacgacacca 2190 0 
acgaccagtc ctttaacgac tatctctccg ccgccaacat gctctaccct atacccgcca 21960 
acgctaccaa cgtgcccata tccatcccct cccgcaactg ggcggctttc cgcggctggg 22 02 0 
ccttcacgcg ccttaagact aaggaaaccc catcactggg ctcgggctac gacccttatt 22 08 0 
acacctactc tggctctata ccctacctag atggaacctt ttacctcaac cacaccfctta 22140 
agaaggtggc cattaccttt gactcttctg tcagctggcc tggcaatgac cgcctgctta 22200 
cccccaacga gtttgaaatt aagcgctcag ttgacgggga gggttacaac gttgcccagt 222 60 
gtaacatgac caaagactgg ttcctggtac aaatgctagc taactacaac attggctacc 22 32 0 
agggcttcta tatcccagag agctacaagg accgcatgta ctccttcttt agaaacttcc 223 80 
agcccatgag ccgtcaggtg gtggatgata ctaaatacaa ggactaccaa caggtgggca 22440 
tcctacacca acacaacaac tctggatttg ttggctacct tgcccccacc atgcgcgaag 2250 0 
gacaggccta ccctgctaac ttcccctatc cgcttatagg caagaccgca gttgacagca 22560 
ttacccagaa aaagtttctt tgcgatcgca ccctttggcg catcccattc tccagtaact 2262 0 
ttatgtccat gggcgcactc acagacctgg gccaaaacct tctctacgcc aactccgccc 22 680 
acgcgctaga catgactttt gaggtggatc ccatggacga gcccaccctt ctttatgttt 22 74 0 
tgtttgaagt ctttgacgtg gtccgtgtgc accggccgca ccgcggcgtc atcgaaaccg 22 80 0 
tgtacctgcg cacgcccttc tcggccggca acgccacaac ataaagaagc aagcaacatc 2 2 860 
aacaacagct gccgccatgg gctccagtga gcaggaactg aaagccattg tcaaagatct 22 92 0 
tggttgtggg ccatattttt tgggcaccta tgacaagcgc tttccaggct ttgtttctcc 22 98 0 
acacaagctc gcctgcgcca tagtcaatac ggccggtcgc gagactgggg gcgtacactg 2 3 040 
gatggccttt gcctggaacc cgcactcaaa aacatgctac ctctttgagc cctttggctt 23100 
ttctgaccag cgactcaagc aggtttacca gtttgagtac gagtcactcc tgcgccgtag 23160 
cgccattgct tcttcccccg accgctgtat aacgctggaa aagtccaccc aaagcgtaca 2322 0 
ggggcccaac tcggccgcct gtggactatt ctgctgcatg tttctccacg cctttgccaa 23280 
ctggccccaa actcccatgg atcacaaccc caccatgaac cttattaccg gggtacccaa 23340 
ctccatgctc aacagtcccc aggtacagcc caccctgcgt cgcaaccagg aacagctcta 23400 
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cagcttcctg gagcgccact cgccctactt ccgcagccac agtgcgcaga ttaggagcgc 2346 0 
cacttctttt tgtcacttga aaaacatgta aaaataatgt actagagaca ctttcaataa 2352 0 
aggcaaatgc ttttatttgt acactctcgg gtgattattt acccccaccc ttgccgtctg 23580 
cgccgtttaa aaatcaaagg ggttctgccg cgcatcgcta tgcgccactg gcagggacac 23 640 
gttgcgatac tggtgtttag tgctccactt aaactcaggc acaaccatcc gcggcagctc 23 70 0 
ggtgaagttt tcactccaca ggctgcgcac catcaccaac gcgtttagca ggtcgggcgc 23760 
cgatatcttg aagtcgcagt tggggcctcc gccctgcgcg cgcgagttgc gatacacagg 23 82 0 
gttgcagcac tggaacacta tcagcgccgg gtggtgcacg ctggccagca cgctcttgtc 23 88 0 
ggagatcaga tccgcgtcca ggtcctccgc gttgctcagg gcgaacggag tcaactttgg 23 940 
tagctgcctt cccaaaaagg gcgcgtgccc aggctttgag ttgcactcgc accgtagtgg 240 0 0 
catcaaaagg tgaccgtgcc cggtctgggc gttaggatac agcgcctgca taaaagcctt 24060 
gatctgctta aaagccacct gagcctttgc gccttcagag aagaacatgc cgcaagactt 2412 0 
gccggaaaac tgattggccg gacaggccgc gtcgtgcacg cagcaccttg cgtcggtgtt 24180 
ggagatctgc accacatttc ggccccaccg gttcttcacg atcttggcct tgctagactg 24240 
ctccttcagc gcgcgctgcc cgttttcgct cgtcacatcc atttcaatca cgtgctcctt 24300 
atttatcata atgcttccgt gtagacactt aagctcgcct tcgatctcag cgcagcggtg 24360 
cagccacaac gcgcagcccg tgggctcgtg atgcttgtag gtcacctctg caaacgactg 24420 
caggtacgcc tgcaggaatc gccccatcat cgtcacaaag gtcttgttgc tggtgaaggt 244 80 
cagctgcaac ccgcggtgct cctcgttcag ccaggtcttg catacggccg ccagagcttc 24540 
cacttggtca ggcagtagtt tgaagttcgc ctttagatcg ttatccacgt ggtacttgtc 24600 
catcagcgcg cgcgcagcct ccatgccctt ctcccacgca gacacgatcg gcacactcag 24660 
cgggttcatc accgtaattt cactttccgc ttcgctgggc tcttcctctt cctcttgcgt 24720 
ccgcatacca cgcgccactg ggtcgtcttc attcagccgc cgcactgtgc gcttacctcc 24780 
tttgccatgc ttgattagca ccggtgggtt gctgaaaccc accatttgta gcgccacatc 24840 
ttctctttct tcctcgctgt ccacgattac ctctggtgat ggcgggcgct cgggcttggg 24900 
agaagggcgc ttctttttct tcttgggcgc aatggccaaa tccgccgccg aggtcgatgg 24960 
ccgcgggctg ggtgtgcgcg gcaccagcgc gtcttgtgat gagtcttcct cgtcctcgga 25020 
ctcgatacgc cgcctcatcc gcttttttgg gggcgcccgg ggaggcggcg gcgacgggga 250 80 
cggggacgac acgtcctcca tggttggggg acgtcgcgcc gcaccgcgtc cgcgctcggg 25140 
ggtggtttcg cgctgctcct cttcccgact ggccatttcc ttctcctata ggcagaaaaa 252 00 
gatcatggag tcagtcgaga agaaggacag cctaaccgcc ccctctgagt tcgccaccac 252 60 
cgcctccacc gatgccgcca acgcgcctac caccttcccc gtcgaggcac ccccgcttga 25320 
ggaggaggaa gtgattatcg agcaggaccc aggttttgta agcgaagacg acgaggaccg 25380 
ctcagtacca acagaggata aaaagcaaga ccaggacaac gcagaggcaa acgaggaaca 25440 
agtcgggcgg ggggacgaaa ggcatggcga ctacctagat gtgggagacg acgtgctgtt 25500 
gaagcatctg cagcgccagt gcgccattat ctgcgacgcg ttgcaagagc gcagcgatgt 25560 
gcccctcgcc atagcggatg tcagccttgc ctacgaacgc cacctattct caccgcgcgt 25620 
accccccaaa cgccaagaaa acggcacatg cgagcccaac ccgcgcctca acttctaccc 25680 
cgtatttgcc gtgccagagg tgcttgccac ctatcacatc tttttccaaa actgcaagat 25740 
acccctatcc tgccgtgcca accgcagccg agcggacaag cagctggcct tgcggcaggg 258 00 
cgctgtcata cctgatatcg cctcgctcaa cgaagtgcca aaaatctttg agggtcttgg 25860 
acgcgacgag aagcgcgcgg caaacgctct gcaacaggaa aacagcgaaa atgaaagtca 2 5 920 
ctctggagtg ttggtggaac tcgagggtga caacgcgcgc ctagccgtac taaaacgcag 25 98 0 
catcgaggtc acccactttg cctacccggc acttaaccta ccccccaagg tcatgagcac 26040 
agtcatgagt gagctgatcg tgcgccgtgc gcagcccctg gagagggatg caaatttgca 2610 0 
agaacaaaca gaggagggcc tacccgcagt tggcgacgag cagctagcgc gctggcttca 26160 
aacgcgcgag cctgccgact tggaggagcg acgcaaacta atgatggccg cagtgctcgt 26220 
taccgtggag cttgagtgca tgcagcggtt ctttgctgac ccggagatgc agcgcaagct 2 6280 
agaggaaaca ttgcactaca cctttcgaca gggctacgta cgccaggcct gcaagatctc 2 6340 
caacgtggag ctctgcaacc tggtctccta ccttggaatt ttgcacgaaa accgccttgg 2640 0 
gcaaaacgtg cttcattcca cgctcaaggg cgaggcgcgc cgcgactacg tccgcgactg 2 6460 
cgtttactta tttctatgct acacctggca gacggccatg ggcgtttggc agcagtgctt 26520 
ggaggagtgc aacctcaagg agctgcagaa actgctaaag caaaacttga aggacctatg 26580 
gacggccttc aacgagcgct ccgtggccgc gcacctggcg gacatcattt tccccgaacg 2 6640 
cctgcttaaa accctgcaac agggtctgcc agacttcacc agtcaaagca tgttgcagaa 26700 
ctttaggaac tttatcctag agcgctcagg aatcttgccc gccacctgct gtgcacttcc 26760 
tagcgacttt gtgcccatta agtaccgcga atgccctccg ccgctttggg gccactgcta 2682 0 
ccttctgcag ctagccaact accttgccta ccactctgac ataatggaag acgtgagcgg 26880 
tgacggtcta ctggagtgtc actgtcgctg caacctatgc accccgcacc gctccctggt 2 694 0 
ttgcaattcg cagctgctta acgaaagtca aattatcggt acctttgagc tgcagggtcc 270 0 0 
ctcgcctgac gaaaagtccg cggctccggg gttgaaactc actccggggc tgtggacgtc 2 7060 
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ggcttacctt cgcaaatttg tacctgagga ctaccacgcc cacgagatta ggttctacga 2712 0 
agaccaatcc cgcccgccaa atgcggagct taccgcctgc gtcattaccc agggccacat 27180 
tcttggccaa ttgcaagcca tcaacaaagc ccgccaagag tttctgctac gaaagggacg 27240 
gggggtttac ttggaccccc agtccggcga ggagctcaac ccaatccccc cgccgccgca 2 7300 
gccctatcag cagcagccgc gggcccttgc ttcccaggat ggcacccaaa aagaagctgc 2 7360 
agctgccgcc gccacccacg gacgaggagg aatactggga cagtcaggca gaggaggttt 2 7420 
tggacgagga ggaggaggac atgatggaag actgggagag cctagacgag gaagcttccg 2 7480 
aggtcgaaga ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc 2 754 0 
cccagaaatc ggcaaccggt tccagcatgg ctacaacctc cgctcctcag gcgccgccgg 276 0 0 
cactgcccgt tcgccgaccc aaccgtagat gggacaccac tggaaccagg gccggtaagt 27660 
ccaagcagcc gccgccgtta gcccaagagc aacaacagcg ccaaggctac cgctcatggc 27720 
gcgggcacaa gaacgccata gttgcttgct tgcaagactg tgggggcaac atctccttcg 27780 
cccgccgctt tcttctctac catcacggcg tggccttccc ccgtaacatc ctgcattact 27840 
accgtcatct ctacagccca tactgcaccg gcggcagcgg cagcggcagc aacagcagcg 27900 
gccacacaga agcaaaggcg accggatagc aagactctga caaagcccaa gaaatccaca 27960 
gcggcggcag cagcaggagg aggagcgctg cgtctggcgc ccaacgaacc cgtatcgacc 2 8 020 
cgcgagctta gaaacaggat ttttcccact ctgtatgcta tatttcaaca gagcaggggc 2 80 80 
caagaacaag agctgaaaat aaaaaacagg tctctgcgat ccctcacccg cagctgcctg 2 814 0 
tatcacaaaa gcgaagatca gcttcggcgc acgctggaag acgcggaggc tctcttcagt 2 8200 
aaatactgcg cgctgactct taaggactag tttcgcgccc tttctcaaat ttaagcgcga 2 82 60 
aaactacgtc atctccagcg gccacacccg gcgccagcac ctgtcgtcag cgccattatg 28320 
agcaaggaaa ttcccacgcc ctacatgtgg agttaccagc cacaaatggg acttgcggct 28380 
ggagctgccc aagactactc aacccgaata aactacatga gcgcgggacc ccacatgata 2844 0 
tcccgggtca acggaatccg cgcccaccga aaccgaattc tcttggaaca ggcggctatt 2 8500 
accaccacac ctcgtaataa ccttaatccc cgtagttggc ccgctgccct ggtgtaccag 28560 
gaaagtcccg ctcccaccac tgtggtactt cccagagacg cccaggccga agttcagatg 2 8 620 
actaactcag gggcgcagct tgcgggcggc tttcgtcaca gggtgcggtc gcccgggcag 28680 
ggtataactc acctgacaat cagagggcga ggtattcagc tcaacgacga gtcggtgagc 2 874 0 
tcctcgcttg gtctccgtcc ggacgggaca tttcagatcg gcggcgccgg ccgtccttca 28800 
ttcacgcctc gtcaggcaat cctaactctg cagacctcgt cctctgagcc gcgctctgga 28860 
ggcattggaa ctctgcaatt tattgaggag tttgtgccat cggtctactt taaccccttc 28920 
tcgggacctc ccggccacta tccggatcaa tttattccta actttgacgc ggtaaaggac 28980 
tcggcggacg gctacgactg aatgttaagt ggagaggcag agcaactgcg cctgaaacac 29040 
ctggtccact gtcgccgcca caagtgcttt gcccgcgact ccggtgagtt ttgctacttt 2 910 0 
gaattgcccg aggatcatat cgagggcccg gcgcacggcg tccggcttac cgcccaggga 2 9160 
gagcttgccc gtagcctgat tcgggagttt acccagcgcc ccctgctagt tgagcgggac 2 9220 
aggggaccct gtgttctcac tgtgatttgc aactgtccta accttggatt acatcaagat 2 9280 
ttaattaatt gccacatcct cttacacttt ttcatacatt gcccaagaat aaagaatcgt 2 9340 
ttgtgttatg tttcaacgtg tttatttttc aattgcagaa aatttcaagt catttttcat 29400 
tcagtagtat agccccacca ccacatagct tatacagatc accgtacctt aatcaaactc 29460 
acagaaccct agtattcaac ctgccacctc cctcccaaca cacagagtac acagtccttt 29520 
ctccccggct ggccttaaaa agcatcatat catgggtaac agacatattc ttaggtgtta 2 9580 
tattccacac ggtttcctgt cgagccaaac gctcatcagt gatattaata aactccccgg 29640 
gcagctcact taagttcatg tcgctgtcca gctgctgagc cacaggctgc tgtccaactt 29700 
gcggttgctt aacgggcggc gaaggagaag tccacgccta catgggggta gagtcataat 2 9760 
cgtgcatcag gatagggcgg tggtgctgca gcagcgcgcg aataaactgc tgccgccgcc 2 9820 
gctccgtcct gcaggaatac aacatggcag tggtctcctc agcgatgatt cgcaccgccc 29880 
gcagcataag gcgccttgtc ctccgggcac agcagcgcac cctgatctca cttaaatcag 29940 
cacagtaact gcagcacagc accacaatat tgttcaaaat cccacagtgc aaggcgctgt 300 00 
atccaaagct catggcgggg accacagaac ccacgtggcc atcataccac aagcgcaggt 3 00 60 
agattaagtg gcgacccctc ataaacacgc tggacataaa cattacctct tttggcatgt 30120 
tgtaattcac cacctcccgg taccatataa acctctgatt aaacatggcg ccatccacca 30180 
ccatcctaaa ccagctggcc aaaacctgcc cgccggctat acactgcagg gaaccgggac 30240 
tggaacaatg acagtggaga gcccaggact cgtaaccatg gatcatcatg ctcgtcatga 3 03 00 
tatcaatgtt ggcacaacac aggcacacgt gcatacactt cctcaggatt acaagctcct 3 03 60 
cccgcgttag aaccatatcc cagggaacaa cccattcctg aatcagcgta aatcccacac 3 042 0 
tgcagggaag acctcgcacg taactcacgt tgtgcattgt caaagtgtta cattcgggca 30480 
gcagcggatg atcctccagt atggtagcgc gggtttctgt ctcaaaagga ggtagacgat 30540 
ccctactgta cggagtgcgc cgagacaacc gagatcgtgt tggtcgtagt gtcatgccaa 3 0 600 
atggaacgcc ggacgtagtc atatttcctg aagcaaaacc aggtgcgggc gtgacaaaca 30660 
gatctgcgtc tccggtctcg ccgcttagat cgctctgtgt agtagttgta gtatatccac 30720 
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tctctcaaag 
gctgccctga 
ttctgcgagt 
ttccaaaaga 
ggtggcgtgg 
aatggcttcc 
agggtgaatc 
tcgccacctt 
aatctgctcc 
tcaggttcct 
tcccgtaggt 
gcggccactt 
ctcggagcta 
aaaatgcaag 
gtagtcatgc 
catttttctc 
aacatttaaa 
cggactacgg 
accgacagct 
tgattcatcg 
gcgtagagac 
cacataaaca 
aacatacagc 
tattaaaaaa 
caagtgcaga 
aacacccaga 
ttcctcaaat 
caattcccaa 
cgccccgcgc 
aaggtatatt 



catccaggcg 
taacatccac 
cacacacggg 
ttatccaaaa 
tcaaactcta 
aaaaggcaaa 
tcctctataa 
ctcaatatat 
agagcgccct 
cacagacctg 
cccttcgcag 
ccccgccagg 
tgctaaccag 
gtgctgctca 
tcatgcagat 
tcaaacatgt 
cattagaagc 
ccatgccggc 
cctcggtcat 
gtcagtgcta 
aacattacag 
cctgaaaaac 
gcttcacagc 
acaccactcg 
gcgagtatat 
aaaccgcacg 
cgtcacttcc 
cacatacaag 
cacgtcacaa 
attgatgatg 



ccccctggct 
caccgcagaa 
aggagcggga 
cctcaaaatg 
cagccaaaga 
cggccctcac 
acattccagc 
ctctaagcaa 
ccaccttcag 
tataagattc 
ggccagctga 
aaccttgaca 
cgtagccccg 
aaaaatcagg 
aaaggcaggt 
ctgcgggttt 
ctgtcttaca 
gtgaccgtaa 
gtccggagtc 
aaaagcgacc 
cccccatagg 
cctcctgcct 
ggcagcctaa 
acacggcacc 
ataggactaa 
cgaacctacg 
gttttcccac 
ttactccgcc 
actccacccc 



tcgggttcta 
taagccacac 
agagctggaa 
aagatctatt 
acagataatg 
gtccaagtgg 
accttcaacc 
atcccgaata 
cctcaagcag 
aaaagcggaa 
acataatcgt 
aaagaaccca 
atgtaagctt 
caaagcctcg 
aagctccgga 
ctgcataaac 
acaggaaaaa 
aaaaactggt 
ataatgtaag 
gaaatagccc 
aggtataaca 
aggcaaaata 
cagtcagcct 
agctcaatca 
aaaatgacgt 
cccagaaacg 
gttacgtaac 
ctaaaaccta 
ctcattatca 



tgtaaactcc 
ccagccaacc 
gaaccatgtt 
aagtgaacgc 
gcatttgtaa 
acgtaaaggc 
atgcccaaat 
ttaagtccgg 
cgaatcatga 
cattaacaaa 
gcaggtctgc 
cactgattat 
tgttgcatgg 
cgcaaaaaag 
accaccacag 
acaaaataaa 
caacccttat 
caccgtgatt 
actcggtaaa 

gggggaatac 

aaattaatag 
gcaccctccc 
taccagtaaa 
gtcacagtgt 
aacggttaaa 
aaagccaaaa 
ttcccatttt 
cgtcacccgc 
tattggcttc 



ttcatgcgcc 
tacacattcg 
ttttttttta 
gctcccctcc 
gatgttgcac 
taaacccttc 
aattctcatc 
ccattgtaaa 
ttgcaaaaat 
aataccgcga 
acggaccagc 
gacacgcata 
gcggcgatat 
aaagcacatc 
aaaaagacac 
ataacaaaaa 
aagcataaga 
aaaaagcacc 
cacatcaggt 
atacccgcag 
gagagaaaaa 
gctccagaac 
aaagaaaacc 
aaaaaagggc 
gtccacaaaa 
aacccacaac 
aagaaaacta 
cccgttccca 
aatccaaaat 



<210> 107 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<2 23> Linker Sequence 
<400> 107 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly 
15 10 



30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32480 



Ser 



<210> 108 
<211> 8383 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid pDV60 
<400> 108 

gacggatcgg gagatctccc gatcccctat 
ccgcatagtt aagccagtat ctgctccctg 
cgagcaaaat ttaagctaca acaaggcaag 
ttagggttag gcgttttgcg ctgcttcgcg 
gattattgac tagttattaa tagtaatcaa 
tggagttccg cgttacataa cttacggtaa 
cccgcccatt gacgtcaata atgacgtatg 
attgacgtca atgggtggac tatttacggt 



ggtcgactct cagtacaatc tgctctgatg 60 
cttgtgtgtt ggaggtcgct gagtagtgcg 12 0 
gcttgaccga caattgcatg aagaatctgc 18 0 
atgtacgggc cagatatacg cgttgacatt 24 0 
ttacggggtc attagttcat agcccatata 30 0 
atggcccgcc tggctgaccg cccaacgacc 3 60 
ttcccatagt aacgccaata gggactttcc 42 0 
aaactgccca cttggcagta catcaagtgt 48 0 
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atcatatgcc aagtacgccc cctattgacg 
atgcccagta catgacctta tgggactttc 
tcgctattac catggtgatg cggttttggc 
actcacgggg atttccaagt ctccacccca 
aaaatcaacg ggactttcca aaatgtcgta 
gtaggcgtgt acggtgggag gtctatataa 
ctgcttactg gcttatcgaa attaatacga 
gagctcggat ccactctctt ccgcatcgct 
ctccctctga aaagcgggca tgacttctgc 
ggatttgata ttcacctggc ccgcggtgat 
agaaaagaca atctttttgt tgtcaagctt 
cagcaacttg gcgatggagc gcagggtttg 
cgcgatgttt agctgcacgt attcgcgcgc 
gcgctcgtcg ggcaccaggt gcacgcgcca 
gctggtggct acctctccgc gtaggcgctc 
cgagcagaat ggcggtaggg ggtctagctg 
aaagaccccg ggcagcaggc gcgcgtcgaa 
cgcctgctgc catgcgcggg cggcaagcgc 
tggcatgggg tgggtgagcg cggaggcgta 
ctctctgagt attccaagat atgtagggta 
gtaatcgtat agttcgtgcg agggagcgag 
ctgctctgct cggaagacta tctgcctgaa 
acgctggaag acgttgaagc tggcgtctgt 
gtaggagtcg cgcagcttgt tgaccagctc 
gtccagggtt tccttgatga tgtcatactt 
gttgaggaca aactcttcgc ggtctttcca 
cgaacgagat ccgtactccg ccgccgaggg 
aaaacctctc gagaaaggcg tctaaccagt 
aagaccgtct gaagatacct tcaaccccgt 
aactgtgcct tttcttactc ctccctttgt 
tggggtactc tctttgcgcc tatccgaacc 
caaaatgggc aacggcctct ctctggacga 
cactgtgagc ccacctctca aaaaaaccaa 
cctcacagtt acctcagaag ccctaactgt 
caacacactc accatgcaat cacaggcccc 
tgccacccaa ggacccctca cagtgtcaga 
cctcaccacc accgatagca gtacccttac 
cactggtagc ttgggcattg acttgaaaga 
actaaagtac ggggctcctt tgcatgtaac 
tggtccaggt gtgactatta ataatacttc 
ttttgattca caaggcaata tgcaacttaa 
aaacagacgc cttatacttg atgttagtta 
aagactagga cagggccctc tttttataaa 
caaaggcctt tacttgttta cagcttcaaa 
cactgccaag gggttgatgt ttgacgctac 
tgaatttggt tcacctaatg caccaaacac 
cctagaattt gattcaaaca aggctatggt 
cagcacaggt gccattacag taggaaacaa 
accagctcca tctcctaact gtagactaaa 
cttaacaaaa tgtggcagtc aaatacttgc 
tttggctcca atatctggaa cagttcaaag 
tggagtgcta ctaaacaatt ccttcctgga 
tcttactgaa ggcacagcct atacaaacgc 
tccaaaatct cacggtaaaa ctgccaaaag 
agacaaaact aaacctgtaa cactaaccat 
cacaactcca agtgcatact ctatgtcatt 
taatgaaata tttgccacat cctcttacac 
gcggccgctc gagcatgcat ctagagggcc 
ctcgctgatc agcctcgact gtgccttcta 
ccgtgccttc cttgaccctg gaaggtgcca 
aaattgcatc gcattgtctg agtaggtgtc 



tcaatgacgg taaatggccc gcctggcatt 540 
ctacttggca gtacatctac gtattagtca 600 
agtacatcaa tgggcgtgga tagcggtttg 660 
ttgacgtcaa tgggagtttg ttttggcacc 72 0 
acaactccgc cccattgacg caaatgggcg 780 
gcagagctct ctggctaact agagaaccca 84 0 
ctcactatag ggagacccaa gcttggtacc 900 
gtctgcgagg gccagctgtt ggggtgagta 960 
gctaagattg tcagtttcca aaaacgagga 102 0 
gcctttgagg gtggccgcat ccatctggtc 10 80 
ggtggcaaac gacccgtaga gggcgttgga 114 0 
gtttttgtcg cgatcggcgc gctccttggc 12 0 0 
aacgcaccgc cattcgggaa agacggtggt 12 60 
accgcggttg tgcagggtga caaggtcaac 132 0 
gttggtccag cagaggcggc cgcccttgcg 13 80 
cgtctcgtcc ggggggtctg cgtccacggt 144 0 
gtagtctatc ttgcatcctt gcaagtctag 1500 
gcgctcgtat gggttgagtg ggggacccca 1560 
catgccgcaa atgtcgtaaa cgtagagggg 162 0 
gcatcttcca ccgcggatgc tggcgcgcac 16 8 0 
gaggtcggga ccgaggttgc tacgggcggg 1740 
gatggcatgt gagttggatg atatggttgg 18 0 0 
gagacctacc gcgtcacgca cgaaggaggc 1860 
ggcggtgacc tgcacgtcta gggcgcagta 192 0 
atcctgtccc ttttttttcc acagctcgcg 1980 
gtactcttgg atcggaaacc cgtcggcctc 2040 
acctgagcga gtccgcatcg accggatcgg 210 0 
cacagtcgca agatccaaga tgaagcgcgc 2160 
gtatccatat gacacggaaa ccggtcctcc 222 0 
atcccccaat gggtttcaag agagtccccc 22 80 
tctagttacc tccaatggca tgcttgcgct 2340 
ggccggcaac cttacctccc aaaatgtaac 24 0 0 
gtcaaacata aacctggaaa tatctgcacc 2460 
ggctgccgcc gcacctctaa tggtcgcggg 2 52 0 
gctaaccgtg cacgactcca aacttagcat 2580 
aggaaagcta gccctgcaaa catcaggccc 264 0 
tatcactgcc tcaccccctc taactactgc 2700 
gcccatttat acacaaaatg gaaaactagg 2760 
agacgaccta aacactttga ccgtagcaac 2820 
cttgcaaact aaagttactg gagccttggg 2880 
tgtagcagga ggactaagga ttgattctca 294 0 
tccgtttgat gctcaaaacc aactaaatct 3 00 0 
ctcagcccac aacttggata ttaactacaa 3 060 
caattccaaa aagcttgagg ttaacctaag 3120 
agccatagcc attaatgcag gagatgggct 3180 
aaatcccctc aaaacaaaaa ttggccatgg 3240 
tcctaaacta ggaactggcc ttagttttga 3300 
aaataatgat aagctaactt tgtggaccac 3360 
tgcagagaaa gatgctaaac tcactttggt 342 0 
tacagtttca gttttggctg ttaaaggcag 3480 
tgctcatctt attataagat ttgacgaaaa 3540 
cccagaatat tggaacttta gaaatggaga 36 0 0 
tgttggattt atgcctaacc tatcagctta 3 660 
taacattgtc agtcaagttt acttaaacgg 3 72 0 
tacactaaac ggtacacagg aaacaggaga 3 7 80 
ttcatgggac tggtctggcc acaactacat 3 84 0 
tttttcatac attgcccaag aataaaagaa 3900 
ctattctata gtgtcaccta aatgctagag 3960 
gttgccagcc atctgttgtt tgcccctccc 4020 
ctcccactgt cctttcctaa taaaatgagg 40 80 
attctattct ggggggtggg gtggggcagg 414 0 
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acagcaaggg ggaggattgg gaagacaata gcaggcatgc tggggatgcg gtgggctcta 420 0 
tggcttctga ggcggaaaga accagctggg gctctagggg gtatccccac gcgccctgta 42 60 
gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca 432 0 
gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg fctcgccggct 4380 
ttccccgtca agctctaaat cggggcatcc ctttagggtt ccgatttagt gctttacggc 444 0 
acctcgaccc caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat 45 0 0 
agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc 4560 
aaactggaac aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgg 4620 
ggatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaattaat 4 68 0 
tctgtggaat gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag gcaggcagaa 474 0 
gtatgcaaag catgcatctc aattagtcag caaccaggtg tggaaagtcc ccaggctccc 4 80 0 
cagcaggcag aagtatgcaa agcatgcatc tcaattagtc agcaaccata gtcccgcccc 4 86 0 
taactccgcc catcccgccc ctaactccgc ccagttccgc ccattctccg ccccatggct 4920 
gactaatttt ttttatttat gcagaggccg aggccgcctc tgcctctgag ctattccaga 4980 
agtagtgagg aggctttttt ggaggcctag gcttttgcaa aaagctcccg ggagcttgta 5040 
tatccatttt cggatctgat caagagacag gatgaggatc gtttcgcatg attgaacaag 5100 
atggattgca cgcaggttct ccggccgctt gggtggagag gctattcggc tatgactggg 516 0 
cacaacagac aatcggctgc tctgatgccg ccgtgttccg gctgtcagcg caggggcgcc 522 0 
cggttctttt tgtcaagacc gacctgtccg gtgccctgaa tgaactgcag gacgaggcag 52 80 
cgcggctatc gtggctggcc acgacgggcg ttccttgcgc agctgtgctc gacgttgtca 5340 
ctgaagcggg aagggactgg ctgctattgg gcgaagtgcc ggggcaggat ctcctgtcat 54 0 0 
ctcaccttgc tcctgccgag aaagtatcca tcatggctga tgcaatgcgg cggctgcata 546 0 
cgcttgatcc ggctacctgc ccattcgacc accaagcgaa acatcgcatc gagcgagcac 552 0 
gtactcggat ggaagccggt cttgtcgatc aggatgatct ggacgaagag catcaggggc 55 8 0 
tcgcgccagc cgaactgttc gccaggctca aggcgcgcat gcccgacggc gaggatctcg 5 64 0 
tcgtgaccca tggcgatgcc tgcttgccga atatcatggt ggaaaatggc cgcttttctg 5700 
gattcatcga ctgtggccgg ctgggtgtgg cggaccgcta tcaggacata gcgttggcta 57 6 0 
cccgtgatat tgctgaagag cttggcggcg aatgggctga ccgcttcctc gtgctttacg 5 82 0 
gtatcgccgc tcccgattcg cagcgcatcg ccttctatcg ccttcttgac gagttcttct 5880 
gagcgggact ctggggttcg aaatgaccga ccaagcgacg cccaacctgc catcacgaga 5 94 0 
tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt tccgggacgc 6 00 0 
cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc accccaactt 6060 
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 612 0 
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 6180 
tgtctgtata ccgtcgacct ctagctagag cttggcgtaa tcatggtcat agctgtttcc 624 0 
tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 63 0 0 
taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 63 6 0 
cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 642 0 
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 6480 
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 654 0 
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 6 60 0 
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 666 0 
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 6 72 0 
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 6780 
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct caatgctcac gctgtaggta 6 84 0 
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 690 0 
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 696 0 
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 7 02 0 
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 7 0 80 
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 714 0 
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 720 0 
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 72 60 
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 7 32 0 
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 73 8 0 
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 744 0 
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 75 0 0 
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 7560 
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 762 0 
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 7 68 0 
gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 774 0 
ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 7800 
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aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 7 86 0 
atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 7 92 0 
cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 7980 
gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 8 04 0 
agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 810 0 
gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 816 0 
caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 822 0 
ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 8280 
tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 834 0 
aggggttccg cgcacatttc cccgaaaagt gccacctgac gtc 8383 

<210> 109 
<211> 7960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid pDV67 
<400> 109 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 12 0 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 180 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 240 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 3 00 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 420 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 4 80 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 72 0 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 840 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 90 0 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 102 0 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 1080 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 114 0 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 12 0 0 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 126 0 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 1320 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 13 80 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 144 0 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 150 0 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 162 0 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 168 0 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 174 0 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 18 0 0 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 186 0 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 1920 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 210 0 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 222 0 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 2280 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 2400 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 2460 
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taaacctgga 
ccgcacctct 
tgcacgactc 
tagccctgca 
cctcaccccc 
atacacaaaa 
taaacacttt 
ctaaagttac 
gaggactaag 
atgctcaaaa 
acaacttgga 
aaaagcttga 
ccattaatgc 
tcaaaacaaa 
taggaactgg 
ataagctaac 
aagatgctaa 
cagttttggc 
ttattataag 
attggaactt 
ttatgcctaa 
tcagtcaagt 
acggtacaca 
actggtctgg 
acattgccca 
gatcagcctc 
cttccttgac 
catcgcattg 
agggggagga 
ctgaggcgga 
cattaagcgc 
tagcgcccgc 
gtcaagctct 
accccaaaaa 
tttttcgccc 
gaacaacact 
cggcctattg 
gaatgtgtgt 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
ttttcggatc 
ataatacgac 
caccgcgcgc 
ggacttcgtg 
cgcggtccag 
ggacgagctg 
gccggccatg 
ggccggcaac 
cgattccacc 
ctggatgatc 
tattgcagct 
atttttttca 
ctgtataccg 
gtgaaattgt 
agcctggggt 
tttccagtcg 
aggcggtttg 
cgttcggctg 



aatatctgca 
aatggtcgcg 
caaacttagc 
aacatcaggc 
tctaactact 
tggaaaacta 
gaccgtagca 
tggagccttg 
gattgattct 
ccaactaaat 
tattaactac 
ggttaaccta 
aggagatggg 
aattggccat 
ccttagtttt 
tttgtggacc 
actcactttg 
tgttaaaggc 
atttgacgaa 
tagaaatgga 
cctatcagct 
ttacttaaac 
ggaaacagga 
ccacaactac 
agaataaaag 
gactgtgcct 
cctggaaggt 
tctgagtagg 
ttgggaagac 
aagaaccagc 
ggcgggtgtg 
tcctttcgct 
aaatcggggc 
acttgattag 
tttgacgttg 
caaccctatc 
gttaaaaaat 
cagttagggt 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
tgatcagcac 
aaggtgagga 
gacgtcgccg 
gaggacgact 
gaccaggtgg 
tacgccgagt 
accgagatcg 
tgcgtgcact 
gccgccttct 
ctccagcgcg 
tataatggtt 
ctgcattcta 
tcgacctcta 
tatccgctca 
gcctaatgag 
ggaaacctgt 
cgtattgggc 
cggcgagcgg 



cccctcacag 
ggcaacacac 
attgccaccc 
cccctcacca 
gccactggta 
ggactaaagt 
actggtccag 
ggttttgatt 
caaaacagac 
ctaagactag 
aacaaaggcc 
agcactgcca 
cttgaatttg 
ggcctagaat 
gacagcacag 
acaccagctc 
gtcttaacaa 
agtttggctc 
aatggagtgc 
gatcttactg 
tatccaaaat 
ggagacaaaa 
gacacaactc 
attaatgaaa 
aagcggccgc 
tctagttgcc 
gccactccca 
tgtcattcta 
aatagcaggc 
tggggctcta 
gtggttacgc 
ttcttccctt 
atccctttag 
ggtgatggtt 
gagtccacgt 
tcggtctatt 
gagctgattt 
gtggaaagtc 
tcagcaacca 
catctcaatt 
ccgcccagtt 
gccgaggccg 
ctaggctttt 
gtgttgacaa 
actaaaccat 
gagcggtcga 
tcgccggtgt 
tgccggacaa 
ggtcggaggt 
gcgagcagcc 
tcgtggccga 
atgaaaggtt 
gggatctcat 
acaaataaag 
gttgtggttt 
gctagagctt 
caattccaca 
tgagctaact 
cgtgccagct 
gctcttccgc 
tatcagctca 



ttacctcaga 
tcaccatgca 
aaggacccct 
ccaccgatag 
gcttgggcat 
acggggctcc 
gtgtgactat 
cacaaggcaa 
gccttatact 
gacagggccc 
tttacttgtt 
aggggttgat 
gttcacctaa 
ttgattcaaa 
gtgccattac 
catctcctaa 
aatgtggcag 
caatatctgg 
tactaaacaa 
aaggcacagc 
ctcacggtaa 
ctaaacctgt 
caagtgcata 
tatttgccac 
tcgagtctag 
agccatctgt 
ctgtcctttc 
ttctgggggg 
atgctgggga 

gggggtatcc 

gcagcgtgac 
cctttctcgc 
ggttccgatt 
cacgtagtgg 
tctttaatag 
cttttgattt 
aacaaaaatt 
cccaggctcc 
ggtgtggaaa 
agtcagcaac 
ccgcccattc 
cctctgcctc 
gcaaaaagct 
ttaatcatcg 
ggccaagttg 
gttctggacc 
ggtccgggac 
caccctggcc 
cgtgtccacg 
gtgggggcgg 
ggagcaggac 
gggcttcgga 
gctggagttc 
caatagcatc 
gtccaaactc 
ggcgtaatca 
caacatacga 
cacattaatt 
gcattaatga 
ttcctcgctc 
ctcaaaggcg 



agccctaact 
atcacaggcc 
cacagtgtca 
cagtaccctt 
tgacttgaaa 
tttgcatgta 
taataatact 
tatgcaactt 
tgatgttagt 
tctttttata 
tacagcttca 
gtttgacgct 
tgcaccaaac 
caaggctatg 
agtaggaaac 
ctgtagacta 
tcaaatactt 
aacagttcaa 
ttccttcctg 
ctatacaaac 
aactgccaaa 
aacactaacc 
ctctatgtca 
atcctcttac 
agggcccgtt 
tgtttgcccc 
ctaataaaat 
tggggtgggg 
tgcggtgggc 
ccacgcgccc 
cgctacactt 
cacgttcgcc 
tagtgcttta 
gccatcgccc 
tggactcttg 
ataagggatt 
taacgcgaat 
ccaggcaggc 
gtccccaggc 
catagtcccg 
tccgccccat 
tgagctattc 
cccgggagct 
gcatagtata 
accagtgccg 
gaccggctcg 
gacgtgaccc 
tgggtgtggg 
aacttccggg 
gagttcgccc 
tgacacgtgc 
atcgttttcc 
ttcgcccacc 
acaaatttca 
atcaatgtat 
tggtcatagc 
gccggaagca 
gcgttgcgct 
atcggccaac 
actgactcgc 
gtaatacggt 



gtggctgccg 
ccgctaaccg 
gaaggaaagc 
actatcactg 
gagcccattt 
acagacgacc 
tccttgcaaa 
aatgtagcag 
tatccgtttg 
aactcagccc 
aacaattcca 
acagccatag 
acaaatcccc 
gttcctaaac 
aaaaataatg 
aatgcagaga 
gctacagttt 
agtgctcatc 
gacccagaat 
gctgttggat 
agtaacattg 
attacactaa 
ttttcatggg 
actttttcat 
taaacccgct 
tcccccgtgc 
gaggaaattg 
caggacagca 
tctatggctt 
tgtagcggcg 
gccagcgccc 
ggctttcccc 
cggcacctcg 
tgatagacgg 
ttccaaactg 
ttggggattt 
taattctgtg 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
tgtatatcca 
tcggcatagt 
ttccggtgct 
ggttctcccg 
tgttcatcag 
tgcgcggcct 
acgcctccgg 
tgcgcgaccc 
tacgagattt 
gggacgccgg 
ccaacttgtt 
caaataaagc 
cttatcatgt 
tgtttcctgt 
taaagtgtaa 
cactgcccgc 
gcgcggggag 
tgcgctcggt 
tatccacaga 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
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P^caggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 618 0 
paaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 6240 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 63 0 0 
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 6360 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct 6420 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 6480 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 6540 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 660 0 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 6660 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 672 0 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 6780 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 6840 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 6 90 0 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 6960 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 7020 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 70 80 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 714 0 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 720 0 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 726 0 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 7320 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 7380 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 744 0 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 7500 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 7560 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 7620 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 7680 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 7740 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 7 80 0 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 7860 
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 7 92 0 
ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 7 96 0 



<210> 110 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pGEM5TS3H forward primer 
<400> 110 

atgggatcca agatgaagcg cgcaagaccg 

<210> 111 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



30 



<220> 

<223> pGEM5TS3H reverse primer 
<400> 111 

cactatagcg gccgcattct cagtcatctt 3 0 

<210> 112 
<211> 7989 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid pDV69 
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<221> misc_f eature 
<222> 4242,4245 
<223> n = A,T,C or G 

<400> 112 

gacggatcgg gagatctccc gatcccctat ggtcgactct cagtacaatc tgctctgatg 60 
ccgcatagtt aagccagtat ctgctccctg cttgtgtgtt ggaggtcgct gagtagtgcg 12 0 
cgagcaaaat ttaagctaca acaaggcaag gcttgaccga caattgcatg aagaatctgc 18 0 
ttagggttag gcgttttgcg ctgcttcgcg atgtacgggc cagatatacg cgttgacatt 24 0 
gattattgac tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata 30 0 
tggagttccg cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc 360 
cccgcccatt gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc 42 0 
attgacgtca atgggtggac tatttacggt aaactgccca cttggcagta catcaagtgt 48 0 
atcatatgcc aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt 54 0 
atgcccagta catgacctta tgggactttc ctacttggca gtacatctac gtattagtca 600 
tcgctattac catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg 660 
actcacgggg atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc 720 
aaaatcaacg ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg 780 
gtaggcgtgt acggtgggag gtctatataa gcagagctct ctggctaact agagaaccca 84 0 
ctgcttactg gcttatcgaa attaatacga ctcactatag ggagacccaa gctggctagc 90 0 
gtttaaactt aagcttggta ccgagctcgg atccactctc ttccgcatcg ctgtctgcga 960 
gggccagctg ttggggtgag tactccctct gaaaagcggg catgacttct gcgctaagat 102 0 
tgtcagtttc caaaaacgag gaggatttga tattcacctg gcccgcggtg atgcctttga 10 80 
gggtggccgc atccatctgg tcagaaaaga caatcttttt gttgtcaagc ttggtggcaa 114 0 
acgacccgta gagggcgttg gacagcaact tggcgatgga gcgcagggtt tggtttttgt 12 00 
cgcgatcggc gcgctccttg gccgcgatgt ttagctgcac gtattcgcgc gcaacgcacc 12 60 
gccattcggg aaagacggtg gtgcgctcgt cgggcaccag gtgcacgcgc caaccgcggt 13 2 0 
tgtgcagggt gacaaggtca acgctggtgg ctacctctcc gcgtaggcgc tcgttggtcc 13 8 0 
agcagaggcg gccgcccttg cgcgagcaga atggcggtag ggggtctagc tgcgtctcgt 144 0 
ccggggggtc tgcgtccacg gtaaagaccc cgggcagcag gcgcgcgtcg aagtagtcta 15 00 
tcttgcatcc ttgcaagtct agcgcctgct gccatgcgcg ggcggcaagc gcgcgctcgt 1560 
atgggttgag tgggggaccc catggcatgg ggtgggtgag cgcggaggcg tacatgccgc 162 0 
aaatgtcgta aacgtagagg ggctctctga gtattccaag atatgtaggg tagcatcttc 16 80 
caccgcggat gctggcgcgc acgtaatcgt atagttcgtg cgagggagcg aggaggtcgg 1740 
gaccgaggtt gctacgggcg ggctgctctg ctcggaagac tatctgcctg aagatggcat 18 0 0 
gtgagttgga tgatatggtt ggacgctgga agacgttgaa gctggcgtct gtgagaccta 1860 
ccgcgtcacg cacgaaggag gcgtaggagt cgcgcagctt gttgaccagc tcggcggtga 192 0 
cctgcacgtc tagggcgcag tagtccaggg tttccttgat gatgtcatac ttatcctgtc 1980 
cctttttttt ccacagctcg cggttgagga caaactcttc gcggtctttc cagtactctt 2040 
ggatcggaaa cccgtcggcc tccgaacgag atccgtactc cgccgccgag ggacctgagc 2100 
gagtccgcat cgaccggatc ggaaaacctc tcgagaaagg cgtctaacca gtcacagtcg 2160 
caagatccaa gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 22 2 0 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 22 80 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 2340 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 24 0 0 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 2460 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 252 0 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 2580 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 2 640 
tagccctgca aacatcaggc cccctcacca ccaccgatag cagtaccctt actatcactg 2700 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gagcccattt 2760 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 2 82 0 
taaacacttt gaccgtagca actggtccag gtgtgactat taataatact tccttgcaaa 2 88 0 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgcaactt aatgtagcag 2 94 0 
gaggactaag gattgattct caaaacagac gccttatact tgatgttagt tatccgtttg 3 00 0 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 3 060 
acaacttgga tattaactac aacaaaggcc tttacttgtt tacagcttca aacaattcca 312 0 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacgct acagccatag 3180 
ccattaatgc aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 3240 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 33 0 0 
taggaactgg ccttagtttt gacagcacag gtgccattac agtaggaaac aaaaataatg 33 60 
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ataagctaac tttgtggacc ggtccaaaac cagaagccaa ctgcataatt gaatacggga 342 0 
aacaaaaccc agatagcaaa ctaactttaa tccttgtaaa aaatggagga attgttaatg 3480 
gatatgtaac gctaatggga gcctcagact acgttaacac cttatttaaa aacaaaaatg 3540 
tctccattaa tgtagaacta tactttgatg ccactggtca tatattacca gactcatctt 3600 
ctcttaaaac agatctagaa ctaaaataca agcaaaccgc tgactttagt gcaagaggtt 3 660 
ttatgccaag tactacagcg tatccatttg tccttcctaa tgcgggaaca cataatgaaa 3720 
attatatttt tggtcaatgc tactacaaag caagcgatgg tgcccttttt ccgttggaag 3780 
ttactgttat gcttaataaa cgcctgccag atagtcgcac atcctatgtt atgacttttt 3840 
tatggtcctt gaatgctggt ctagctccag aaactactca ggcaaccctc ataacctccc 3900 
catttacctt ttcctatatt agagaagatg actgattttt aagaagcggc cgctcgagtc 3 960 
tagagggccc gtttaaaccc gctgatcagc ctcgactgtg ccttctagtt gccagccatc 4020 
tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc ccactgtcct 4080 
ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt ctattctggg 414 0 
gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca ggcatgctgg 42 00 
ggatgcggtg ggctctatgg cttctgaggc ggaaagaacc snccntagct ggggctctag 42 60 
ggggtatccc cacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 43 2 0 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 43 80 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcggggca tccctttagg 4440 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4500 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 45 60 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4620 
ttttgattta taagggattt tggggatttc ggcctattgg ttaaaaaatg agctgattta 4680 
acaaaaattt aacgcgaatt aattctgtgg aatgtgtgtc agttagggtg tggaaagtcc 4 74 0 
ccaggctccc caggcaggca gaagtatgca aagcatgcat ctcaattagt cagcaaccag 48 0 0 
gtgtggaaag tccccaggct ccccagcagg cagaagtatg caaagcatgc atctcaatta 4860 
gtcagcaacc atagtcccgc ccctaactcc gcccatcccg cccctaactc cgcccagttc 492 0 
cgcccattct ccgccccatg gctgactaat tttttttatt tatgcagagg ccgaggccgc 4980 
ctctgcctct gagctattcc agaagtagtg aggaggcttt tttggaggcc taggcttttg 504 0 
caaaaagctc ccgggagctt gtatatccat tttcggatct gatcagcacg tgttgacaat 5100 
taatcatcgg catagtatat cggcatagta taatacgaca aggtgaggaa ctaaaccatg 516 0 
gccaagttga ccagtgccgt tccggtgctc accgcgcgcg acgtcgccgg agcggtcgag 522 0 
ttctggaccg accggctcgg gttctcccgg gacttcgtgg aggacgactt cgccggtgtg 52 8 0 
gtccgggacg acgtgaccct gttcatcagc gcggtccagg accaggtggt gccggacaac 534 0 
accctggcct gggtgtgggt gcgcggcctg gacgagctgt acgccgagtg gtcggaggtc 54 0 0 
gtgtccacga acttccggga cgcctccggg ccggccatga ccgagatcgg cgagcagccg 5460 
tgggggcggg agttcgccct gcgcgacccg gccggcaact gcgtgcactt cgtggccgag 552 0 
gagcaggact gacacgtgct acgagatttc gattccaccg ccgccttcta tgaaaggttg 5580 
ggcttcggaa tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg 5 64 0 
ctggagttct tcgcccaccc caacttgttt attgcagctt ataatggtta caaataaagc 5700 
aatagcatca caaatttcac aaataaagca tttttttcac tgcattctag ttgtggtttg 5760 
tccaaactca tcaatgtatc ttatcatgtc tgtataccgt cgacctctag ctagagcttg 5 82 0 
gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac 5880 
aacatacgag ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc 5 94 0 
acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg 6000 
cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 6060 
tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 6120 
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 6180 
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 624 0 
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 63 0 0 
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 63 60 
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 6420 
ctttctcaat gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg 6480 
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt 6540 
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg 6600 
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac 6 660 
ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 672 0 
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt 6780 
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt 6840 
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga 6900 
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc 6960 
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 7020 
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atctcagcga 
actacgatac 
cgctcaccgg 
agtggtcctg 
gtaagtagtt 
gtgtcacgct 
gttacatgat 
gtcagaagta 
cttactgtca 
ttctgagaat 
accgcgccac 
aaactctcaa 
aactgatctt 
caaaatgccg 
ctttttcaat 
gaatgtattt 
cctgacgtc 



tctgtctatt 

gggagggctt 

ctccagattt 
caactttatc 
cgccagttaa 
cgtcgtttgg 
cccccatgtt 
agttggccgc 
tgccatccgt 
agtgtatgcg 
atagcagaac 
ggatcttacc 
cagcatcttt 
caaaaaaggg 
attattgaag 
agaaaaataa 



tcgttcatcc 
accatctggc 
atcagcaata 
cgcctccatc 
tagtttgcgc 
tatggcttca 
gtgcaaaaaa 
agtgttatca 
aagatgcttt 
gcgaccgagt 
tttaaaagtg 
gctgttgaga 
tactttcacc 
aataagggcg 
catttatcag 
acaaataggg 



atagttgcct 
cccagtgctg 
aaccagccag 
cagtctatta 
aacgttgttg 
ttcagctccg 
gcggttagct 
ctcatggtta 
tctgtgactg 
tgctcttgcc 
ctcatcattg 
tccagttcga 
agcgtttctg 
acacggaaat 
ggttattgtc 
gttccgcgca 



gactccccgt 
caatgatacc 
ccggaagggc 
attgttgccg 
ccattgctac 
gttcccaacg 
ccttcggtcc 
tggcagcact 
gtgagtactc 
cggcgtcaat 
gaaaacgttc 
tgtaacccac 
ggtgagcaaa 
gttgaatact 
tcatgagcgg 
catttccccg 



cgtgtagata 
gcgagaccca 
cgagcgcaga 
ggaagctaga 
aggcatcgtg 
atcaaggcga 
tccgatcgtt 
gcataattct 
aaccaagtca 
acgggataat 
ttcggggcga 
tcgtgcaccc 
aacaggaagg 
catactcttc 
atacatattt 
aaaagtgcca 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
7989 



<210> 113 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 113 

gtcactcgag gactcggtcg actgaaaatg agacatatta tctgccacgg acc 

<210> 114 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 114 

cgagatcgat cacctccggt acaaggtttg gcatag 

<210> 115 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



53 



36 



<400> 115 

catgaagatc tggaaggtgc tgaggtacga tgagacc 

<210> 116 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



37 



<400> 116 

gcgacttaag cagtcagctg agacagcaag acacttgctt gatccaaatc c 



51 
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<210> 117 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 117 

cacgaattcg tcagcgcttc tcgtcgcgtc caagaccc 

<210> 118 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 118 

caccccgggg aggcggcggc gacggggacg gg 



